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ABSTRACT 

This  thesis  consists  of  essays  on  several  aspects  of  the 
problem  cf  algebraic  simplification  by  computer.  Since  simpli¬ 
fication  Is  at  the  core  of  most  algebraic  manipulations,  effi¬ 
cient  and  effective  simplification  procedures  are  essential  to 
building  useful  computer  systems  for  non-numerlcal  mathematics. 
Efficiency  Is  attained  through  carefully  designed  and  engineered 
algorithms,  heuristics,  and  data  types,  while  effectiveness  is 
assured  through  theoretical  considerations. 

Chapter  1  Is  an  Introduction  to  the  field  of  algebraic  ma¬ 
nipulation,  and  serves  to  place  the  following  chapters  In  per¬ 
spective. 

Chapter  2  reports  on  an  original  design  for,  and  program¬ 
ming  Implementation  of,  a  pattern  matching  system  Intended  to 
recognize  non-obvlcus  occurrences  of  patterns  within  algebraic 
expressions.  A  user  of  such  a  system  can  "teach"  the  computer 
new  simplification  rules. 

Chapter  3  reports  on  new  applications  of  standard  mathema¬ 
tical  algorithms  used  for  canonical  simplification  of  rational 
expressions.  These  applications.  In  combinations,  allow  a 
computer  system  to  contain  a  fair  amount  of  expertise  In  seve-al 
areas  of  algebraic  manipulation. 
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Chapter  4  reports  on  a  new,  practical,  canonical  slmpli- 
flcation  algorithm  for  radical  expressions  (i.e.  algebraic  ex¬ 
pressions  including  roots  of  polynomials).  The  effectiveness  of 
the  procedure  is  assured  through  proofs  of  appropriate  properties 
of  these  simplified  expressions. 

Chapter  b  Is  a  brief  summary  and  a  discussion  of  potential 
research  areas. 

Two  appendices  describe  MACSYMA,  a  computer  system  for 

symbolic  manipulation,  an  effort  of  some  dozen  researchers 

(including  the  author)  which  has  served  as  the  vehicle  for  this 
work. 
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PREFACE 

This  thesis  describes  a  number  of  contributions  to  the  art 
and  science  of  manipulating  algebraic  expressions  hy  computer. 

All  the  experiments  were  performed  using  MACSYMA,  a  computer 
system  for  symbolic  manipulation  of  algebraic  expressions  now 
under  development  at  the  Massachusetts  Institute  of  Technology's 
Project  MAC.  The  contributions  to  MACSYMA  of  some  12  people  are 
detailed  in  (31).  fly  contributions  are  as  follows. 

I  designed  and  programmed  the  rational  function  package, 
the  radical  simplifier,  the  semantic  matching  subsystem,  "SOLVE", 
the  rational  "substitution"  and  "coefficient"  routines,  and 
portions  of  the  supervisor  and  top-level  simplifier.  I  also 
designed  and  implemented  a  major  revision  of  the  polynomial 
package  incorporating  the  fast  modular  greatest  common  divisor 
algorithm  (3).  This  revision  makes  possible  the  implementation  of 
the  much  improved  factorization  algorithm  now  In  progress  (2). 

Previous  theses  which  describe  parts  of  MACSYMA  or  its 
logical  predecessors  ((30),  (35))  have  included  LISP  (33) 
listings  of  the  programs  used.  At  this  point  it  is  becoming 
impractical  to  include  such  listings,  constituting  several 
hundred  printed  pages.  Furthermore,  such  publication  is  of 
doubtful  usefulness  since  listings  and  an  operational  system  will 
be  available  in  the  near  future  to  a  community  of  users  through  the  ARPA 
computer  network.  The  system  presently  occupies  some  110,000  30- 
bit  computer  words  and  will  undoubtedly  continue  to  grow. 


5 


Work  reported  herein  was  supported  in  part  by  Project  MAC, 
an  M.l.T.  interdepartmental  laboratory  sponsored  by  the  Advanced 
Research  Projects  Agency  (ARPA),  Department  of  Defense,  under 
Office  of  Naval  Research  Contract  f!P0014-70-A-n362-0001 .  The 
author  was  supported,  while  a  graduate  student,  by  ARPA  under  Air 
Force  contract  F  1*^628— 68  —  0101  wi  th  Harvard  University,  hy  the 
National  Science  Foundation  under  their  Graduate  Traineeship  pro- 
cram  and  by  Bell  Telephone  Laboratories  under  a  contract  wi th 
Harvard  University.  Revision  of  this  thesis  prior  to  publication 
as  a  MAC  Technical  Report  has  been  supported  in  part  by  the 
Department  of  Mathematics,  M.l.T.  throuch  National  Science 
Foundation  Grant  GP  22796. 

i  wish  to  thank  Professor  Joel  Moses  of  M.l.T,  for  his 
continuing  interest,  comments,  and  suggestions,  which  have  guided 
me  in  this  work. 

i  would  also  like  to  thank  Professor  A.  G.  Oettinger  of 
Harvard,  for  providing  support,  advice,  encouragement,  and 
valuable  criticism,  while  allowing  me  considerable  freedom  in  my 
area  of  research. 


6 


CONTENTS 

Chapter  1  -  Introduction  .  g 

1.1  Algebraic  Manipulation  .  g 

1.2  Algebraic  Manipulation  by  Computer:  Prospects  and 

lealltles  . j2 

1.3  Problems  and  Goals  . 14 

1.4  Specific  Goals  of  the  Thesis  . .  18 

Chapter  2  -  The  User-Level  Semantic  Matching  Capability  In 

MACSYMA  .  23 

2.0  Introduction  and  Overview  .  23 

2.1  Predicates  and  Declarations  .  29 

2.2  Match  Definitions  .  32 

2.3  Selectors  .  35 

2.4  More  Match  Details  .  37 

2.5  Markov  Algorithms  .  42 

2.5.1  Applying  Rules  .  43 

2.5.2  An  Example  . . 44 

2.6  Advising  the  Simplifier  .  45 

2.7  Non-Commutatlve  Mul tlpl icatlon  .  50 

2.8  Comparisons  with  SCHATCHEN,  FAMOUS/  REDUCE/  Formula  Algol  53 

2.9  Differential  Equations  .  62 

2.10  Other  Applications  .  63 

2.11  Conclusions  .  54 

Appendix  I  to  Chapter  2  -  Detailed  description  of  the  match 

processor  . #t##  55 

Appendix  II  to  Chapter  2  -  LISP  listing  of  QUAD  .  73 

Appendix  1 1 1  to  Chapter  2  -  Pattern  matching  from  a 

theoretical  standpoint . . .  75 


7 


Chapter  3  -  Extending  the  Power  of  the  Rational  Function 

Facilities  . . .  32 

3.0  An  Introduction  and  a  "Political"  Digression  .  86 

3.1  Basic  Rational  Function  Commands  .  88 

3.2  Contagious  CRE  Commands  .  91 

3.3  The  Rational  Coefficient  Program . .  93 

3.4  Simple  Extensions  to  Rational  Simplification  .  96 

3.5  The  Rational  Substitution  Commands  .  98 

3.6  The  Solve  Program  . . 

3.7  Conclusions  . . 

Chapter  4  -  Simplification  of  Radical  Expressions  . 109 

4.1  Introduction  . HO 

4.2  Basic  Concepts  . . 

4.3  Radical  Polynomials  and  Expressions  . 120 

4.4  Comparisons  with  Previous  Work  on  Radical  Expressions  ..120 

4.4.1  Algebraic  approaches  . 120 

4.4.2  Zero-equi valence  tests  . 122 

4.5  Simplified  Radical  Polynomials  . 126 

4.6  Algorithms . . . 12g 

4.6.1  Removing  quotients  from  radicals  . 129 

4.6.2  Producing  a  ratio  of  radical  polynomials  . 129 

4.6.3  Simplifying  radical  polynomials  . 129 

4.6.4  Rationalizing  denominators 


132 


8 


4.7  Properties  of  the  oinpllfiecJ  . . . 

4.7.1  Overview  and  History  . . . 

4.7.2  The  Theorem  . . . 

4.8  Canonical  Forms  . ,..140 

4.9  Additional  Radical  Proposals  . . ...142 

4.10  Conclusions  . 14j 

Chapter  5  -  Summary  and  Prospects  for  the  Future  . 146 

5.1  Summary  of  Present  Capabilities  . . 

5.2  Prospects  for  the  Future  . . 

Appendix  I  -  MACSYMA  Users'  Manual  . 154 

Appendix  II  -  The  Polynomial  and  Rational  Function  Package  . 176 

Bibliography  .  lg4 


9 


Chapter  1  *  Introduction 

Many  persons  who  are  not  conversant  with  mathematical 
studies  imagine  that  because  the  business  of 
[Babbage's  Analytical  Engine]  is  to  give  its  results 
in  numerical  notation,  the  nature  of  its  processes 
must  consequently  be  arithmetical  and  numerical, 
rather  than  algebraical  and  analytical.  This  Is  an 
error,  i he  engine  can  arrange  and  combine  its 
numerical  quantities  exactly  as  if  they  were  letters 
or  any  other  general  symbols;  and  In  fact  it  might 
bring  out  its  results  in  algebraic  notation,  were 
provisions  made  accordingly. 


--Ada  Augusta,  Countess  of  Lovelace 
(1844)  ((26),  p.  1) 


During  the  past  decade,  developments  in  computer  hardware 
and  software  have  started  to  accomplish  what  Lady  Lovelace 
envisioned  over  a  century  ago.  By  dealing  with  algebraic 
expressions,  equations,  and  functions  in  terms  of  their  symbolic 
representations,  without  reference  to  specific  numerical  values, 
computers  are  aiding  working  scientists  and  engineers  facing  a 
variety  of  non-numeric  mathematical  tasks.  Borne  of  the  problems 
and  potentials  of  algebraic  manipulation  by  computer,  and  its 
most  central  process,  simplification,  are  the  topics  of  this 
thesi s. 

1.1.  Algebraic  Manipulation 

To  illustrate  the  difference  between  numeric  and  symbolic 

processing,  consider  a  FORTRAN  program  which,  given  A,  U  and  C, 

can  apply  the  quadratic  formula  to  approximate  the  roots  of 
2 

Ax  +Bx+C  *  0.  A,  B  and  C  must,  of  course,  have  numerical  values 
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at  run-time.  This  Is  strictly  numerical  processing.  If  A  had  as 
Its  run- time  value  the  expression  "Q,"  B  had  value  "(-P*Q-l)," 
ana  C  had  value  "P,"  the  FORTRAN  program  would  be  useless. 

Nevertheless,  by  applying  the  quadratic  formula  symbolically,  the 
two  roots. 


,  2  2 

_“C-  P  Q  -  1)  ♦  SQRT( P  Q  ♦  2  P  Q  ♦  1  ■  4  p  q) 

2  Q 

can  be  represented.  By  further  efforts,  this  expression  can  be 
reduced  to 

Cl  ♦  P  Q)  ♦  (1  -  P  Q) 

2  Q 

or  the  two  values  P  and  1/Q.  u„e  computer  system  for  algebraic 
manipulation  system,  MACSYMA,  which  Is  now  under  development  at 
M.l.T's  Project  MAC  (31)  and  Is  the  test-bed  for  most  of  the  work 
described  In  this  thesis,  can  be  coaxed  Into  performing  this  cal¬ 
culation  through  the  following  dialogue.  The  lines  labelled  Cl 
are  typed  by  the  user,  those  labelled  Dl  and  El  by  the  MACSYMA 
system.  (This,  along  with  most  of  the  other  examples  in  this 
thesis  consists  of  a  file  produced  directly  by  MACSYMA  which  was 
later  merged  with  the  remainder  of  the  text.) 
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(Cl)  EXP:Q*X**2-(1+P*U)*X  °«0(d 
2 

(01)  QX  -(PC  ♦  1)  X  ♦  P  ■  o 

( C2)  S0LVE(EXP,X)<s 

SOLUTION 

(E2)  X  -  - 

Q 

(E3)  X  -  P 

(03)  <E2,E3) 

It  should  be  emphasized  that  all  of  the  work  described  here 
Is  wedded  to  MACSYMA  by  convenience,  not  necessity.  The  tech¬ 
niques  which  are  considered  are  of  Interest  because  of  their 
relevance  to  mathematical  problem  solving  In  general,  and  to 
algebraic  manipulation  by  computer  most  particularly.  Although 
details  of  Implementation  will  differ,  the  algorithms  presented 
here  should  be  useful  In  a  number  of  computer  systems  now  under 
development  (1).  Since  It  serves  as  a  concrete  base  for 
comparing  our  techniques  with  those  of  other  systems,  we  will 
make  frequent  references  to  MACSYMA;  however,  the  philosophy  and 
algorithms,  rather  than  the  programs  themselves  are  really  the 
topics  of  interest.  Details  of  the  implementation  have  been 
Included  when  they  serve  to  Illustrate  particular  points  In 
dealing  with  problems  of  algebraic  manipulation. 
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1.2.  Algebraic  Manipulation  by  Computer: 

Prospects  and  Realities 

I  fully  agree  with  R.  W.  Hamming  that  "the  purpose  of 
computing  Is  Insight/  not  numbers."  ((38)/  p.  vill). 

Mathematical  analysis  has  traditionally  been  preferable  to 
numerical  approximation  techniques  because  the  resulting  exact 
symbolic  answers  often  represent  a  more  direct  path  to  insight 
than  sets  of  approximate  numbers.  In  the  search  for  insight  Into 
mathematical  and  physical  problems/  difficult  analytical  and 
algebraic  tasks  should  be  delegated  to  computers  Just  as  diffi¬ 
cult  numerical  tasks  have  been  delegated  In  the  past.  I  believe 
that  computers  can  serve  an  important  function  In  analysis  analo¬ 
gous  to  the  role  they  have  come  to  serve  both  In  bringing 
numerical  analysis  to  Its  present  state  of  refinement/  and  In 
producing  answers  to  real  problems. 

An  algebraic  manipulation  system  Is  able  to  rapidly  and 
reliably  "massage"  expressions  orders  of  magnitude  larger  than 
ones  comfortably  handled  by  humans.  For  example/  computers  have 
demonstrated  their  facility  In  handling  numbers/  hundreds  of 
digits  In  length/  and  equations  requiring  several  pages  for 
display. 

These  advantages  are  fairly  obvious.  Unfortunately/ 
attempts  to  harness  these  advantages  have  often  ignored  a  number 
of  major  problems  (detailed  below)  which  must  be  tackled  In  order 
to  provide  useful  services  to  working  mathematicians.  Most  of 
the  early  "systems"  and  "languages"  for  algebraic  manipulation. 
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having  failed  to  consider  these  problems,  disappeared  shortly 
after  their  introduction.  In  nany  cases,  the  relevant  problems 
were  not  yet  recognized,  much  less  solved.  An  unfortunately 
large  number  of  newer  efforts  In  algebraic  manipulation  systems 
have  fallen  Into  the  same  traps  <e.  g.,  (32))  and  have  not  recog¬ 
nized  the  significant  contributions  of  many  of  the  researchers  of 
the  past  ten  or  so  years.  Some  have  taken  the  attitude  that  a 
slightly  more  flexible  programming  language  Is  all  that  Is  needed 
to  suddenly  open  up  the  realm  of  algebraic  manipulation  capa¬ 
bilities.  These  researchers  (most  often  programming  language 
designers)  should  examine  their  claims  In  the  light  of  the 
Formula  Algol  (37)  experience;  namely,  that  language  features 

t 

alone,  regardless  of  their  variety,  do  not  make  a  useful 
algebraic  manipulation  system.  Algorithms  (23)  and  data 
structures  are  most  important,  and  unless  these  are  carefully 
considered,  researchers  entering  the  field  will  continue  to 
repeat  the  mistakes  of  others;  they  will  stand  on  the  feet, 
rather  than  the  shoulders,  of  the  earlier  contr ibuters. 

We  do  not  wish  to  embark  on  a  survey  of  algebraic 
manipulation  systems  since  there  are  several  easily  accessible 
references.  One  Is  the  exhaustive  annotated  bibliography  of  the 
field  begun  by  Jean  Sammet  and  continued  by  John  l/yman  (42). 
blnce  many  of  the  listed  papers  are  of  historical  interest  only 
(even  many  recent  ones,  for  the  reasons  given  above),  a  more 
selective  source  on  recent  work  Is  a  better  introduction  to  the 
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field.  W. A.  Martin/  in  (30)/  critically  surveys  the  progress  in 
algebraic  manipulation  systems  up  to  1967.  Max  Engel i ,  in  (11)/ 
gives  his  viows  on  achievements  and  problems  in  the  field  to 
196a.  it  is  an  indication  of  the  rapidity  of  change  in  the  field 
that  some  of  the  break-throughs  mentioned  by  Engel i  have  been 
eclipsed  by  more  recent  developments.  (Specifically/  calculating 
factorizations  and  greatest  common  divisors  can  now  be  done  much 
faster  than  by  using  methods  mentioned  by  Engel  1.)  Perhaps  the 
most  useful  index  to  the  field  to  this  time  is  the  "Proceedings 
of  the  Second  Symposium  on  Symbolic  and  Algebraic  Manipulation" 
(March,  1971)  (1).  it  is  a  collection  of  tutorial  and  research 
papers  describing  important  current  work  in  most  areas  of  the 
field.  Chapters  2  and  3  of  this  thesis  were  presented  at  this 
symposium  in  slightly  different  forms  (14)  (31). 

1.3.  Problems  and  Goals 

To  some  extent  the  major  problems  in  algebraic  manipulation 
depend  on  one's  viewpoint.  The  broad  view  Is  to  look  at 
algebraic  manipulation  as  a  problem  in  artificial  intelligence/ 
the  eventual  goal  being  the  construction  of  an  expert 
mathematician  (e.g.  see  (31)),  The  view  taken  here  is  much  more 
limited/  but  can  be  considered  as  a  preliminary  to  the  broader 
problem.  We  wish  to  provide  a  tool  capable  of  performing  a  wide 
range  of  services  for  a  mathematician  or  engineer.  These  can 
perhaps  best  be  envisioned  as  a  spectrum  of  facilities  ranging 
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from  a  fancy  desk-calculator,  to  (In  some  specific  areas)  an 
expert  problem  solver. 

The  system  has  facilities  for  Indefinite  precision  integer 
and  rational  number  arithmetic  and  finite  field  (modular  (9)) 
arithmetic.  In  addition  to  the  usual  floating-point  facilities  of 
a  modern  digital  computer.  It  has  the  ability  to  perform  all 
elementary  operations  on  multivariate  polynomials  and  rational 
functions.  It  Is  capable  of  factoring  polynomials,  finding  their 
greatest  common  divisors,  calculating  partial  fraction 
expansions,  derivatives  and  integrals  of  rational  functions.  It 
can  perform  routine  substitutions,  transpositions,  etc.  It 
Incorporates  the  most  efficient  algorithms  available,  and  may 
have  several  methods  for  performing  a  task,  providing  different 
types  of  efficiency,  or  efficiency  over  a  wider  domain  than  Is 
possible  with  a  single  method. 

As  we  understand  larger  classes  of  functions  and  oper¬ 
ations,  the  practical  power  of  the  system  will  be  expanded. 

Radical  expressions  (e.g.  roots  of  polynomials)  constitute  one 
class  which  has  been  added  to  MACSYMA  by  this  author.  Recent 
additions  Include  Inequalities,  polynomial  ar i thnetlc  over  finite 
fields,  and  power  series  generation  and  manipulation. 

Further  along  the  spectrum  toward  an  expert  mathematician, 
we  can  envision  an  ideal  system  as  follows.  It  understands 
scientific  notations  and  can  he  taught  special  notations.  It  Is 
clever  at  presenting  results  In  easily  readable  form.  It  can 
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understand  Instructions  (e.£.  an  algorithm  presented  as  an  Algol 
procedure)  and  follow  them  precisely.  It  can  learn  new  methods 
for  solving  problems,  but  It  already  knows  how  to  apply  a  large 
number  of  procedures  (algorithmic  and  heuristic)  which  are  useful 
for  solving  differential  equations  or  sets  of  linear  equations, 
finding  Indefinite  or  (Improper)  definite  Integrals,  limits,  etc. 
It  has  large  amounts  of  data  (e.g.  tables,  textbooks,  simpli¬ 
fication  rules)  at  Its  disposal,  and  can  be  told  to  modify  them 
for  particular  purposes.  It  will  (If  required)  save  all  Its 
calculations,  and  keep  track  of  generated  data  for  future 
references.  It  will  (If  required)  provide  additional  Information 
(e.g.  timing  data,  intermediate  results,  procedures  used)  about 
the  methods  applied  to  solve  the  problem.  It  will  work 
interactively  with  the  user,  or  perform  long  calculations 
(correctly)  In  Its  "master's"  absence,  it  understands  enough 
about  the  problem  domain  to  detect  Inconsistencies  In  its 
Instructions  and  will  balk  it  meaningless  expressions  or 
operations  (e.g.  dlvison  by  zero).  It  can  numerically  evaluate 
expressions  and  produce  plots  of  functions. 

We  do  not  pretend  that  this  view  is,  in  fact,  a  listing  of 
sufficient  components  of  a  modern  algebraic  manipulation  system, 
nor  do  we  claim  that  any  Implementation  of  such  features  will 
model  the  Internal  structure  of  a  mathematician.  We  do  feel, 
however,  that  the  facilities  noted  above  are  Important  goals  for 
a  system  like  MACSYMA.  Furthermore,  a  reasonable  number  of  these 
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goals  have  either  been  achieved#  or  are  being  approached. 

MACSYMA  is  a  large  hierarchical  computer  system  run  in  an 
interactive#  time-shared  environment.  The  real-time  response  of 
such  a  system  is#  we  believe#  necessary  if  a  computer  is  to 
assume  the  role  of  a  mathematical  assistant.  The  user  interacts 
with  MACSYMa  through  its  supervisor#  a  program  which  accepts 
character  strings  in  a  language  resembling  Algol -60.  These 
character  strings  are  parsed  into  LISP  (33)  s-expresslons  and 
passed  to  the  programming  language  evaluator.  This#  along  with 
the  general  simplifier#  forms  the  heart  of  the  system.  The 
supervisor  calls  upon  the  rest  of  the  resources  of  the  system  In 
carrying  out  the  requests  of  the  user. 

Most  commands  invoke  specific  command  programs  which  in 
turn  draw  upon  the  lower  level  routines  to  evaluate#  process# 
simplify#  and  otherwise  produce  an  answer#  which  is  then  returned 
to  the  supervisor.  The  supervisor  displays  the  answer  in  a  two- 
dimensional  textbook-like  format#  and  waits  for  the  next  user 
command.  Generally  some  side  effects  will  also  occur#  corres¬ 
ponding  to  the  assignment  of  values  to  variables#  the  definition 
of  programs#  the  secting  of  switches  affecting  future  system 
behavior#  etc.  Other  available  side-effects  include  additional 
displays  of  expressions  of  interest  and  X-Y  plots  of  numerical 
values.  The  commands  draw  on  a  wide  range  of  facilities  oriented 
about  the  several  nata  types  within  MACSYMA.  These  facilities 
include  algorithms  for  setting  up  and  manipulating  variable- 
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dimensioned  arrays  of  symbolic  elements/  algorithms  for 
performing  definite  and  Indefinite  Integration#  algorithms  for 
calculating  limits  of  functions  of  a  real  variable#  algorithms 
for  the  efficient  manipulation  of  power  series#  polynomials#  and 
rational  functions.  Additionally#  a  subsystem  for  the  Intro¬ 
duction  of  pattern-di rected  transformat  Ions  on  algebraic  expres¬ 
sions  is  included.  Appendix  I#  The  Language  and  Commands  of 
MACSYMA#  offers  specific  examples  of  the  forms  In  which  these 
facilities  are  available.  At  present#  the  desk-calculator  end  of 
the  spectrum  Is  approximated  by  the  facilities  In  MACSYMA  while 
the  more  esoteric  components  are  approximated  only  In  some  quite 
specific  areas.  Figure  1.1  Indicates#  In  basic  outline#  the 
present  components  of  MACSYMA  and  their  Interdependencies.  The 
rectangles  indicate  subsystems  which  are  still  under  development. 

1.4.  Specific  Goals  of  the  Thesis 

This  thesis  is  primarily  a  discussion  of  several 
facilities#  designed  and  Implemented  by  the  author#  which  augment 
the  abilities  of  MACaYMA#  and  In  several  cases#  provide  capabil¬ 
ities  unique  among  current  algebraic  manipulation  systems. 

Chapters  2  and  3  are  concerned  with  the  engineering  of  better 
algebraic  manipulation  systems#  while  chapter  4  presents  the 
theoretical  basis  for  some  of  the  algorithms. 

Chapter  2  discusses  a  user-level  semantic  matching 
capability#  as  impl emented  In  MACSYMA.  This  subsystem  const!- 
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compiler.  Through  this  facility  a  user  can  specify  new  Infor¬ 
mation  and  algorithms  to  the  system  In  a  manner  which  is  concise, 
general,  and  straightforward.  By  simple  top-level  cornands  to 
the  .semantic  matching  subsystem,  new  programs  are  compiled  and 
adjoined  to  the  basic  structure  of  the  system. 

By  taking  advantage  of  the  semantic  properties  of  algebraic 
expressions,  diverse  expressions  are  recognized  as  occurrences  of 
the  sane  pattern.  For  example,  a  semantic  pattern  for  ''quadratic 
In  x"  matches  both  3*x**2+4  and  (x+l)*(x+G). 

Patterns  are  created  by  declaring  variables  to  satisfy 
predicates,  and  then  composing,  out  of  these  variables,  expres¬ 
sions  which  serve  as  templates  for  the  pattern  matching  process. 

Efficiency  Is  achieved  by  compiling  programs  correspond! ng  to 
each  pattern. 

Specific  examples  show  how  this  recognition  capability  Is 
used  In  augmenting  simplification  rules  and  In  writing  algorithms 
for  the  solution  of  differential  equations. 

Other  systems  with  related  capabilities  are  compared  with 
regard  to  their  Implementations  and  matching  strategies. 

Chapter  3  Is  concerned  with  expanding  the  usefulness  of 
algebraic  manipulation  systems  by  taking  advantage  of  canonical 
simplification  programs.  In  this  case  we  refer  specifically  to 
the  rational  function  and  radical  canonical  form  facilities. 

First  the  data  types  and  basic  facilities  are  described,  and  then 
a  number  of  new  results  are  presented.  The  ease  with  which  these 
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can  be  used  Is  a  result  of  a  critical  design  decision  that  algo¬ 
rithms  (regardless  of  their  origin)  should  be  able  to  interact 
easily  with  the  special  data  types  available  in  MACS YMA .  The  new 
facilities  include  a  routine  to  solve  for  a  variable  in  an 
equation  which  is  more  powerful  (in  a  practical  sense)  than  that 
of  any  other  system;  programs  which  are  more  sophisticated  in 
their  ability  to  substitute  values  for  sub-expressions  which 
occur  implicitly  in  a  larger  expression;  and  programs,  used 
extensively  for  pattern  matching,  capable  of  finding  "coef¬ 
ficients  (suitably  defined)  in  an  expression. 

Chapter  4  describes  our  radical  canonical  simplification 
algorithm.  With  this,  many  algorithms  can  be  successfully  applied 
to  larger  classes  of  expressions  than  had  previously  been 
possible.  The  theoretical  results  behind  the  approach  are 
developed,  and  compared  to  the  work  of  Caviness  (5)  and  others. 

The  simplification  procedure  itself  is  shown  to  be  quite 
practical  (in  contrast  to  Caviness'),  and  for  many  purposes,  at 
least  as  useful.  Extensions  to  exponential  and  logarithmic 
situations  are  pointed  out  and  those  which  can  be  implemented  at 
reasonable  cost  have  been  added  to  the  algorithm. 

Chapter  5  summarizes  the  current  capabilities,  both 
theoretical  and  practical,  of  computer  aids  to  non-numer leal 
mathematics,  and  then  discusses  research  problems  which  appear  at 

this  moment  to  be  both  interesting  and  important  from  our  point 
of  view. 
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The  two  main  appendices  serve  as  documentation  for  parts  of 
the  MACSYMA  system.  They  are  not  intended  to  be  complete/  since 
MACSYMA  will  be  in  a  continual  state  of  development  for  at  least 
several  years.  Appendix  I  describes  the  outward  view  of  some  of 
the  MACSYMA  commands.  Appendix  II  describes  the  MACSYMA  rational 
function  package  in  sufficient  detail  to  make  its  transfer  to 
other  LISP  systems  simple.  The  rational  function  package  is  of 
particular  interest  in  that  it  is  self-contained/  and  sufficient 
for  many  polynomial  "crunching"  tasks.  It  includes  a  number  of 
particularly  efficient  algorithm;*/  and  may  be  of  interest  to 
mathematicians  who  prefer  to  dispense  with  the  amenities  provided 
by  a  total  system  in  order  to  make  more  core  storace  available. 
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Chapter  2 

The  User-Level  Semantic  Hatching  Capability  in  MACSYMA 
2.0  introduction  and  Overview 

When  complex  algorithms  are  coded  in  an  algebraic  manipula¬ 
tion  language/  it  is  sometimes  advantageous  to  supplement  the 
command  language  with  a  pattern  recognition  capability,  in 
effect/  a  pattern  recognition  facility  simulates  the  action  of  a 
human  mathematician  who,  by  examining  the  structure  of  a  formula/ 
decides  on  his  next  step,  it  is  to  our  advantage  to  make  this 
recognition  capability  relatively  independent  of  the  particular 
style  in  which  the  formula  is  expressed,  in  particular/  such 
details  as  whether  products  are  distributed  over  sums  or  not/ 
should,  in  some  cases/  be  irrelevant  to  the  matching  process. 

Consider  the  problem  of  solving  linear  differential  equa¬ 
tions  with  constant  coefficients.  Defore  we  can  apply  our 
knowledge  in  any  generaly  useful  manner/  we  must  he  able  to 
recognize  when  a  given  expression  is  an  equation/  a  differential 
equation/  a  linear  differential  equation/  and  a  linear 
differential  equation  with  constant  coefficients.  Because  pattern 
matching  can  perform  this  type  of  decision-making  which  might 
otherwise  require  human  intervention/  it  is  an  important  adjunct 
to  a  computer-aided  mathematical  laboratory.  Often/  only  when  the 
computer  can  recognize  a  given  pattern  and  its  components/  can  it 
proceed  to  the  next  step  in  processing.  Furthermore/  pattern- 
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matching  capabilities  are  essential  to  building  useful  additions 
to  a  mathematical  laboratory.  Through  pattern  matching,  new  sim¬ 
plification  rules  can  be  described,  non-standard  transformations 
can  be  made,  and  algorithms  extended. 

This  chapter  describes  pattern  matching  facilities  designed 
and  Implemented  by  the  author  for  MACSYMA.  Comparisons  with 
other  systems  with  regard  to  both  implementation  and  strategy  are 
included,  as  are  many  examples. 

Patterns  can  be  considered  lexical  entities,  as  in  SNOBOL 
(12).  Inside  an  algebraic  manipulation  system,  such  arbitrary 
strings  of  characters,  e.g.  /A*)(-X*,  are  rarely  useful.  The 
input-line  editor  of  MACSYMA  and  the  parser's  lexical  routines 
are  the  only  portions  of  the  system  concerned  with  more-or-less 
arbitrary  strings  of  characters. 

Patterns  can  be  considered  syntactic  entities,  as  In  FAMOUS 
16)  or  AMBIT/S  (8).  Although  syntactic  correctness  Is  necessary. 

It  Is  not  sufficient  for  algebraic  expressions  to  be  meaningful. 
For  example,  0**0  (using  FOKTKAN  notation)  is  syntactically 
correct,  but  semantically  unclear.  A  syntactic  pattern  for 
"quadratic  in  x"  would  match  expressions  of  the  form 
a*x**2  ♦  b*x  ♦  c,  but  might  fail  to  match  the  expressions  x**2 
and  (x  ♦  1 ) * ( x  ♦  6),  which  are,  however,  quadratic  functions  of 


x. 


25 


Patterns  can  be  considered  semantic  entitles,  given  a 
suitable  context,  we  will  be  concerned  primarily  with  the  context 
and  semantics  of  algebraic  expressions.  A  semantic  pattern  for 
"quadratic  In  x"  should  match  3*x**2  ♦  4  or  (x  ♦  l)*(x  ♦  f),  but 
should  not  match  a*x**2  ♦  b*x  ♦  sln(x),  which  Is  not  a  quadratic 
function  of  x. 

The  notion  of  using  the  semartlcs  of  an  algebraic  expres¬ 
sion  requires  explanation.  Some  properties  of  ordinary  addition 
and  multiplication  can  be  usefully  Included  In  the  design  of  a 
program  Intended  to  recognize  algebraic  expressions  as  Instances 
of  more  general  patterns.  For  example,  knowledge  of  the  fact 
that  addition  Is  commutative  and  has  Identity  0  and  the  fact  that 
multiplication  Is  commutative  and  has  Identity  1,  clearly 
improves  the  probability  of  finding  a  mapping  between  parts  of  a 
pattern  and  instances  of  that  pattern  In  an  expression. 

In  addition  to  these  elementary  properties.  It  is  particu¬ 
larly  useful  for  us  to  work  with  the  fact  that  for  any  poly¬ 
nomial,  P,  a  unique  form  can  be  derived  such  that  the  coefficient 
of  any  variable  In  P  to  some  Integer  power  can  be  found.  Over  a 
larger  class  of  expressions,  a  simplified  form  will  often  display 
this  characteristic  of  having  "obvious"  coefficients  with  respect 
to  sub-parts  of  the  expression. 

We  will  refer  to  these,  and  similar  properties  of  algebraic 
expressions  as  semantic  properties.  By  the  use  of  the  semantic 
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notions  already  mentioned,  a  pattern  A*X+B  might  be  matched  to 
the  expression  X,  with  A  matching  1,  and  B  matching  0. 

Additional  semantic  notions  become  more  difficult  to  choose 
(and  implement  in  a  systematic  fashion).  For  example,  interpre¬ 
tations  involving  exponents  must  be  carefully  restricted  to  avoid 
conflict.  Thus,  if  the  pattern  A**B  is  to  natch  the  expression 
1,  either  A  is  1  and  B  Is  undetermined  flc  B  is  0  and  A  is  non¬ 
zero.  Some  (somewhat  arbitrary)  decisions  concerning  acceptable 
values  for  A  and  B  are  necessary.  f.ACSYMA  makes  such  a  decision, 
which  is  described  in  the  first  appendix  to  this  chapter. 

We  have  chosen  to  implement  the  arithmetic  interpretations 
of  our  matching  programs  using  basically  these  semantic  notions. 

A  less  elaborate  interpretation  would  prevent  us  from 
matching  a  pattern  A*X+B  to  the  expression  X,  with  A  matching  1 
and  B  matching  0. 

A  more  expansive  interpretation  of  the  possibilities  leads 
into  difficulties:  allowing  the  coefficient  of  X**3  in  the 
expression  X**2  to  be  1/X;  allowing  2**n  to  match  the  expression 
0  with  n  matching  negative  Infinity,  etc. 

The  exact  limits  chosen  for  any  given  implementation's 
ability  to  enlarge  upon  the  elemental  syntactic  statement  of  a 
pattern  has  been,  and  will,  no  doubt,  continue  to  be  largely 
pragmatic.  Furthermore,  it  is  our  belief  that  any  attempt  to 
produce  a  concise  formalism  for  a  pattern  matching  interpreter  is 
bound  to  unnecessarily  limit  the  power  of  the  implementation. 


27 


Those  Hatching  formalisms  cannot  take  advantage  of  the  many 
useful,  but  non-systematlc  "tricks"  which  can  be  cleanly  added  to 
a  pattern  matching  program.  Therefore  we  will  continue  to  take  a 
pragmatic  approach  to  semantic  pattern  matching,  and  try  to 
reveal  the  reasoning  behind  our  design  features,  and  the  methods 
used  to  implement  them. 

We  will  refer  to  those  pattern  matching  programs  with 
facilities  which  take  into  account  at  least  the  basic  properties 
of  addition  and  multiplication,  as  semantic. 

Historically,  Slagle's  SAINT  (43)  and  Hoses'  SIN  (35)  were 
the  first  demonstrations  of  a  significant  application  of  semantic 
pattern  matching:  large  classes  of  expressions  were  mapped  Into 
forms  with  known  Integrals.  Other,  more  general  applications, 
some  of  which  are  detailed  below,  range  from  adding  new  opera¬ 
tions  and  simplifications  to  an  algebraic  manipulation  system,  to 
recognizing  and  solving  special  cases  of  differential  equations. 

The  facilities  used  for  pattern  matching  by  Slagle  and 
Moses  were  not  user-oriented.  By  contrast,  the  programs  described 
here  give  the  HACSYMA  user  a  powerful  and  sophisticated  semantic 
matching  capability,  and  the  tools  by  which  he  can  introduce 
these  capabilities  Into  the  command  level  of  the  system  and  into 
his  wn  programs.  Of  the  other  algebraic  manipulation  systems 
currently  in  use,  it  appears  that  only  Hearn's  REDUCE  (19)  has  a 
user-level  matching  facility.  REDUCE  gives  the  user  (through  the 
LET  command)  a  limited  matching  facility  which  is  considerably 
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restricted  In  its  power  by  Its  emphasis  on  efficiency.  For 
example/  patterns  which  are  sums  are  not  permitted.  FAMOUS  (16) 
and  Formula  Algol  (37)/  neither  of  which  is  currently  in  use/ 
provided  matching  facilities/  which  (as  we  shall  see  In  section 
8)/  were  syntactic/  rather  than  semantic  in  approach. 

In  sections  1  to  4,  methods  for  defining  patterns  in 
MACSYMA  are  described/  largely  through  examples.  Section  5 
discusses  MACSYMA' s  Markov  al gor i thm-styl e  (pattern-replacement) 
programming  facility.  Section  6  considers  the  problem  of 
introducing  new  simplification  rules  into  MACSYMA  efficiently  and 
effectively.  Section  7  demonstrates  how  these  techniques  can  be 
used  to  introduce  rules  for  non-commutative  multiplication.  Sec¬ 
tion  8  critically  examines  the  pattern-matching  facilities  of 
SCHATCHEN/  REDUCE/  FAMOUS,  and  Formula  Algol,  and  compares  them 
to  MACSYMA' s  facility.  Questions  of  strategy  and  implementation 
are  considered.  Section  9  considers  applications  of  pattern 
matching  to  solving  differential  equations.  Section  10  suggests 
other  areas  of  usefulness  in  mathematics  and  man-machine 
communication.  These  sections  are  supplemented  by  appendices  t° 
Jjlifi  chapter:  Appendix  1  contains  precise,  extended  definitions 
of  the  matching  procedures;  Appendix  II  includes  an  example  of  a 
match  program  as  compiled  by  the  system;  Appendix  III  considers 
the  problem  of  defining  classes  of  expressions  over  which 
matching  procedures  can  be  considered  effective  --  that  is,  under 
what  circumstances  a  pattern  match  can  determine  membership  in 
formally  defined  classes  of  algebraic  expressions. 
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2.1.  Predicates  and  Declarations 
An  intuitive  pattern  for  a  quadratic  in  x  is 
a*x**2  ♦  B*x  ♦  C  where  A/  B,  and  C  are  pattern  variables  which 
can  natch  numbers  or  other  expressions  free  of  the  variable  x.  In 
addition,  A  must  not  match  zero,  otherwise  linear  expressions 
would  be  included  in  the  domain  of  the  pattern. 

Clearly  we  must  be  able  to  Insist  that  variables  In  a  pat¬ 
tern  have  certain  characteristics  (e.g.  are  nonzero  or  are  free 
of  x);  that  is  ii£  must  be  able  Ifl  make  the  success  &£  £  malCJh 
dependent  fin  Ihs.  matched  values  satisfying  predicates. 

Predicates  (for  our  purposes)  are  prograns  which  return  either 
TRUE  or  FALSE.  In  practice,  we  consider  anything  other  than  FALSE 
as  TRUE.  Patterns  themselves  are  predicates  since  they  return 
FALSE  If  applied  to  a  non-r.iatchlng  expression.  Predicates  can 
take  any  number  of  arguments  (usually  at  least  one)  and  can  be 
defined  In  LISP,  (In  which  MACSYMA  Itself  Is  written)  or  In  the 
liACSYMA  programming  language,  which  resembles  Algol  60. 

FREE0F(X,Y)  Is  a  predicate  with  two  arguments,  X  and  Y, 
which  answers  the  question,  "Does  the  expression  Y  depend  expli¬ 
citly  on  the  variable  X?"  Thus  FREC0F( A,A**2+D)  ’s  FALSE; 

FKEEOF ( A, OS I N(D) )  is  TRUE.  TRUE(X)  is  a  predicate  which  Is 
always  TRUE.  This  is  useful  because  it  Is  convenient  to  allow 
some  variables  to  match  anything.  INT(X)  is  TRUE  when  X  Is  an 
i nteger. 
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FREEOF,  TRUE#  and  I  NT  are  already  defined  in  the  standard 
MACSYMA  system.  We  might  define  NONZERO  by  the  program: 

NONZEKO(X) :■  IF  X-0  THEN  FALSE  ELSE  TRUEQ. 

The  function  SIGNUM(X)  returns  -1,  0  or  +1  respectively  if 
X  <  0/  X  ■  0/  or  X  >  0.  SIGNUM#  we  should  note,  expands  its 
argument  using  MACSYMA* s  rational  function  routines  (see  Chapter 
3).  This  produces  a  form  v/hich  is  canonical  over  rational  func~ 
tlons  Cup  to  the  order  of  the  variables)  and  allows  us  to 
uniquely  determine  a  sign  for  the  coefficient  of  the  highest 
power  of  the  main  variable  (in  the  numerator).  Thus  it  knows  that 
the  following  expressions  are  negative:  -4,  -X,  -X  -  Y,  -Cl  ♦  X). 

Whether  X  -  Y  is  negative  or  not  depends  on  which  variable  (X  or 
Y)  the  rational  function  package  has  been  told  is  the  main 
variable.  It  wl 1 1  choose  a  main  variable  itself  If  necessary. 

The  only  expression  whose  SIGNUM  is  0  is  0.  Using  SIGNUM  we 
can  define: 

NEGATI  VEPREU(X)  :*  IF  S I  GNUM(X)«-1  THEN  TRUE  ELSE  FALSEQ. 

A  few  more  predicates  which  are  used  in  examples  to  follow 

are: 

I NRANGEC  L0W,HI , VAR)  :«  IF  (LOW  <  VAR)  AND  (VAR  <  HI)  THEN  TRUE 
ELSE  FALSEQ 

NONZERUANUFKEEOF ( X, Y )  :■  IF  NONZERO(Y)  THEN  FREE0F(X#Y)  ELSE 
FALSEu. 

To  associate  a  pattern  variable  with  a  predicate#  we  have 
the  DECLARE  command.  I t  has  the  form: 

DECLARE (naue.oredicateCarg^  ...,  arr.(  ))C.  (n  2  C) 
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For  example/ 

DECLAKE( A/ FREEOF(X) )U 
DECLAKE( A/ I NRANGE(N/M) ) 0 
DECLARE(A/TRUE)U 

Note  that  the  last  argument  of  each  predicate  Is  missing 
from  the  declaration.  The  value  matching  the  declared  variable 
will  serve  as  the  final  actual  argument.  Thus  If  A  were  declared 
NONZERO  and  an  attempt  were  made  to  match  A  with  X**2  ♦  3#  then 
NUNZEK0(X**2  ♦  3)  would  be  evaluated.  Since  the  result  would  be 
TRUE/  the  match  would  be  successful/  and  A  would  be  assigned  the 
value  X**2  ♦  3. 

The  binding  times  of  the  arguments  to  DECLARE  must  be 
clarified.  The  first  argument  Is  not  evaluated;  thus 
L)ECLARE( A/ . , )  affects  the  declaration  of  A#  even  if  the  value  of 
A  Is  li  +  2,  The  second  (predicate)  argument  to  DECLARE  Is 
treated  os  an  undefined  function:  If  we  were  to  change  the 
definition  of  INRANGE  to  some  other  function  of  three  argument;*/ 
It  would  not  be  necessary  to  redeclare  A.  The  extra  arguments  to 
the  predicate  (.aCitj/  .../  ai£n)  are  bound  at  the  time  the 
predicate  Is  applied.  Thus  If  A  were  declared  to  be  FREEOF(X), 
and  the  value  of  X  at  some  later  time  were  Zt  an  attempt  to  match 
A  current  with  that  assignment  would  invoke  a  test  to  see  if  the 
potential  match  for  A  were  dependent  on  Z. 
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2.2.  Match  Definitions 

The  DEFMATCM  command  defines  a  new  program  (a  predicate) 
which  will  succeed  only  if  a  particular  semantic  pattern  is 
matched.  The  DEFMATCH  command  has  the  form: 

DEFMATCH(flilttfiramnamg/  Pattern/  catternvar^.  ...,  natternvar  Ifi 

(n  1  0). 

For  example/ 

DEFMATCM( LINEAR,  A*X  ♦  B  ,  X)Q 
DEFMATCH(F3,  X+  3  ♦  F(X,Y,5),  Y) Q 
DEFMATCIK  CUSS  IMP,  COS(N*PI)  ){i 

these  examples  will  have  different  interpretations 
depending  on  the  declarations  (or  lack  of  declarations)  for 
A/B/X/N,  and  F.  The  result  in  jach  case  will  be  a  program  with 
name  PT.fljtraWiaPie  <e.g.  LINEAR,  F3,  COSSiMP)  which  will  test  to 
see  If  the  pattern  pattern  <*.e.  A*X  ♦  B,  etc.)  can  be  applied  to 
it*  first  argument.  The  program  will  have  n  additional  arguments, 
correspond! nc  to  the  patternvars . 

During  the  execution  of  these  resulting  programs, 
undeclared  variables  (l.e.,  those  variables  not  appearing  as  the 
first  argument  In  a  DECLARE  command)  In  the  pattern  are  lambda- 
bound  to  the  values  in  the  program  invocation  If  their  names  are 
among  those  variables  listed  in  the  DEFMATCH  command.  Variables 
not  listed  among  the  patternvar  are  bound  to  their  values  In 

i 

the  environment  at  execution  time.  At  the  successful  conclusion 
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of  a  match/  declared  variables  will  be  assigned  the  values  that 
they  match/  and  a  list  of  the  associations  of  variables  and  their 
values  is  returned. 

An  extended  example  should  clarify  this.  The  lines  labelled 

Ci  are  typed  by  the  user/  the  lines  labelled  l)i  are  typed  by  the 

computer.  Lines  terminated  by  a  $  suppress  printing  of  the 

result.  Lines  terminated  by  an  G  result  in  a  computer  generated 

display  of  the  answer. 

(Cl)  DECLARE(A/NONZEROANDFREEOF(X))$ 

( C 2 )  UECLARE(B/ FREEOF(X) )$ 

(C3)  DEFMATCH(LiNEAR/A*A+B/X)G 
(D3)  LINEAR 

( C4)  Li NEAR( 3*Y+4/ Y) (b 
(D4)  (U  "  4/A  -  3/X  -  Y) 

(C5)  LI NEAR(Z*Y+4+X/ Y) y 
(05)  (B  -  X  ♦  4/A  -  Z/  X  -  Y) 

At  this  point  the  value  of  A  Is  Z/  the  value  of  B  is  X  ♦  4. 

If  the  value  of  X  previous  to  line  C5  had  been  4/  the  answer 

would  have  been  (B  ■  8/ A  ■  Z,  X  ■  Y). 

The  X  on  line  04  is  a  completely  separate  entity  from  the  X 

on  line  C5/  in  that  the  first  is  like  a  formal  parameter  to  a 

subroutine/  and  the  latter  is  a  global  variable  with  the  same 

name.  This  distinction  should  be  apparent  on  line  05. 

The  oatternvar 1  &  may  appear  In  the  declarations  also. 

Thus: 

(CC)  DECLARE(A/ INRANGE(N/M))$ 

(C7)  0EFMATCH( BETWEEN/ A/ N/M)G 
A 

IS  THE  PATTERN 

(C8)  BETWEEN(5/1/6)U 
(08)  (A  -  5/N  «  1/M  -  6) 


34 


The  message  following  line  C7  is  from  the  DEFMATCH 
compiler/  indicating  that  it  had  evaluated  A  to  see  if  perhaps 
A's  value  was  the  intended  pattern,  in  this  case/  the  value  of  A 
was  A/  thus  the  message/  "A  IS  THE  PATTERN"  is  printed.  The 
PdLLern  in  the  DEFMATCH  command  is  generally  not  evaluated/  since 
this  (with  its  substitution  of  values  for  variables)  tends  to 
make  patterns  disappear.  However/  if  (as  in  this  example)  the 
Pd.LLe.rn  is  an  "atom/"  or  single  variable/  then  It  Jj*  evaluated. 
This  allows  a  user  to  compose  an  elaborate  pattern/  say  as  a 
result  of  a  computation/  and  then  give  its  name  to  the  DEFMATCH 
command/  rather  than  having  to  type  it  in  ail  at  once.  If  A  had 
had  the  value  b  ♦  4/  the  message  "B  ♦  4  IS  THE  PATTERN"  would 
have  been  printed. 

Now  that  we  have  shown  how  pattern  programs  are  defined/  we 

can  clarify  the  use  of  the  predicate  TRUE.  Recall  that  declaring 

A  to  be  TRUE  means  that  A  in  a  pattern  will  match  anything 

occupying  the  appropriate  position  in  the  expression.  Thus 

( C9)  DECLARE( A/TRUE) $ 

(CIO)  DECLARE(0/TRUE)$ 

(Cll)  DEFMATCH(G/ A*X+B*Y)jj 
( C12)  G(3*X+l*Y+J*X)u 
(012)  (B  -  i/A  -  J  ♦  3) 

This  illustrates  another  principle  in  matching  patterns. 

II  A  ia  undeclared  dad  nai  a  pattern  variable,  a  1q  ^  pattern 
viLLL  Lia-tah  only  A'd  current  value,  (if  A  has  no  value,  then 
MACSYMA  provides  "A"  for  the  value  of  A.  As  a  special  case/ 
cun 5 tan ts  natch  only  themselves.) 
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2.3.  Selectors 

Sometimes  it  is  not  sufficient  to  find  out  whether  or  not  a 
predicate  succeeds  on  a  given  argument.  Sometimes  we  wish  to  not 
only  test#  but  separate  components  of  a  pattern  which  in  ordinary 
circumstances  would  remain  indivisible.  We  wish  to  permit  a 
special  form  of  predicate  which  (1)  confirms  that  a  subexpression 
satisfies  a  predicate#  and  then  (2)  hands  back  to  the  pattern 
program  more  information  than  just  "the  predicate  succeeded."  We 
will  call  such  programs#  when  used  in  the  place  of  predicates# 
selectors.  The  selectors  that  are  of  the  greatest  interest  to  us 
here  always  "succeed"  in  one  *orm  or  another#  but  in  so  doing# 
return  a  particular  part  of  the  expression  which  is  being 
matched.  Aiding  us  in  this  venture  is  the  convention  that  any 
result  which  is  not  "FALSE"  is  true. 

Consider  the  predicate  INTEGER,  it  returns  TRUE  when 
applied  to  an  integer.  A  corresponding  predefined  selector# 

WHOLE#  returns  only  the  integer  part  of  a  number.  Another 
selector#  FKACTIONPART#  might  be  defined: 

FRACTION PART(X)  :«  X  -  WH0LE(X)5 

It  would  then  have  to  be  designated  a  selector  by: 

S  E  LE CTOR ( F R ACT  I OHPART ) $ . 

A  dialogue  would  look  like  this: 

CC1)  FRACTIGNPART(X)  :«  X  -  WHOLF.(X)$ 

( C2)  SELECTOR (FRACTION PART)* 

( C5)  DECLAKE( A/WHOLE) $ 

( C4)  D EC LARE ( 0 # FRACTION PART )$ 
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(C5)  DEFfiATClK  SEPARATE#  A  ♦  B)$ 

B 

MATCHES  ALL  iN 
B  ♦  A 

( C6) SEPARATE (5/ 2) Q 

1 

(D6)  (A  -  2 #B  »  -) 

2 

The  Message  following  line  C5  would  normally  indicate  an 

error.  Here  it  signifies  that  B's  predicate  (or  selector)  will  be 

applied  to  what  Is  left  after  A's  predicate  (or  selector)  is 

applied.  Here#  this  is  what  Is  intended#  but  note  that  if  both  A 

and  B  had  only  predicates#  SEPARATE  would  natch  one  of  them  to  0 

in  every  case.  The  following  caution  should  be  observed:  if  a 

selector  is  used#  a  complementary  selector  should  generally  be 

used  with  it#  since#  for  example# 

( C7)  DEFf IATCH( F3# A) $ 

A 

IS  THE  PATTERN 

(C8)  F3( 5/2) Q 
(D8)  (A  -  2) 

results.  The  "f ractionpart"  has  (perhaps  unintentionally)  been 
di scarded. 

Another  selector  provided  by  MACSYMA  is  NUMFACTOR#  which 
selects  the  numerical  factor  from  a  product  (or  1#  otherwise).  A 
complementary  selector#  OTHERFACTOR  might  be  defined  by 
OTHEUFACTOR(X)  :«  X/NUMFACTOR( X ) $ 
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Other  selectors  provide  facilities  for  picking  out  Items  In 
a  sum  or  product  one  by  one*  The  notion  of  "extractor"  In  Formula 
Algol  Is  weaker  than  this.  In  that  extractors  can  only  be  used  to 
attach  labels  to  syntactically  distinguishable  subexpressions. 
Thus  the  numerator  of  a  fraction  can  be  labelled  through 
"extraction"  but  the  "whole  part"  of  a  ratio  of  two  numbers 
cannot  be  labelled  through  Formula  Algol. 

2.4.  More  Match  Details 

Patterns  can  be  more  complicated.  For  example,  with  A  and  B 
declared  TKUE,  the  pattern  3**A  ♦  B**4  will  match 

w**4  ♦  3**z  wl  th  A  *  7.  ,  B  ■  w 

w**4  +1  with  A  »  0  ,  B  •  w 

3**z  wl th  A  »  z  ,  B»0 

3  with  A  »  1  ,  B  ■  0 

1  with  A  -  0  ,  B  »  0. 

The  expression  10,  (which  Is  3**2  ♦  1**4)  will  not  match. 
The  exact  limitations  of  the  exponentiation  treatment  are 
described  in  this  chapter’s  Appendix  I. 

Anv  pattern,  or  part  &£  £  pattern.  £  which  1&  entirely  free 
of  variables  which  are  declared  and  ££  vet  unmatched  wLU.  match 
anv  expression  £  such  that  (when  al  1  free  variables  £L£  glVJ&n 
thel r  ass  I cned  values)  £”£"£.  To  some  extent  this  type  of 
match  depends  on  what  algorithm  Is  used  to  simplify  the  result  of 
the  subtraction.  Ordinarily  the  MACSYMA  simplifier  is  used,  but 
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rational  simplification  (see  Chapter  3)  Is  used  when  coefficients 
are  being  picked  off,  since  expansion  Is  often  needed  to  produce 
proper  results.  We  feel  this  is  very  Important  if  we  are  to  abide 
by  our  belief  that  the  semantics  of  the  expression,  rather  than 
the  syntax,  is  the  important  aspect  to  model  In  pattern  matching. 
Thus  the  following  dialogue  Is  possible: 


(Cl)  UECLARE(A,  NONZEROA?JDFREEOF(  X)  )$ 
( C2)  DECLArtE(B,FKEEOF(X) ) $ 

( C3)  UECLARE(C,FREEOF(X))$ 

( C4)  DEFMATCH( QUAD, A*X**2  ♦  B*X  ♦  C 

(C5)  QUAD( (Z+1)*(2+2),2)U 

<°5)  (C  «  2,  B  «  7,A  =  1,X  -  Z) 


Rational  simplification  must  be  used  to  compute  (Z+l)*(Z+2)  - 
(Z**2+3*Z+2),  to  convince  QUAD  that  the  match  has  succeeded.  This 
Is  the  only  effective  r.  hod  at  our  disposal  If  we  wish  to 
implement  such  matches  as  C5.  The  additional  rational  simpli¬ 
fication  Is  not  particularly  inefficient,  since  the  coefficient 
routines  described  in  Chapter  3  have  already  converted  the 
expression  to  a  canonical  rational  form. 

OEFMATCH  has  produced  in  QUAD  a  program  which  operates  as 
follows.  QUAD(E,  X) 

a.  Picks  out  the  coefficient  of  X**2  in  E,  and  If  the  coefficient 

1*  A  an<*  non-zero,  assigns  it  to  A,  otherwise  returns 

b.  bets  E  to  E  -  A*X**2 

C#  the  ?°effjcIent  of  X  in  E,  and  if  the  coefficient  Is 

tree  of  X,  assigns  It  to  H,  otherwise  returns  FALSE. 

d.  bets  E  to  E  -  B*X 

e.  If  E  is  free  of  X,  assigns  E  to  C  and  returns  a  list  of  the 
values  A,  B,  and  C,  otherwise  returns  FALSE. 
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Implicit  in  this  algorithm  are  several  basic  principles  of 
semantic  pattern  matching.  For  example,  line  (C5)  above  demon¬ 
strates  that  coefficients  Id  AD  expression  should  extracted 
semantical  1v  d.e.  the  coefficient  of  Z  must  be  extracted  using 
the  semantics  of  the  operators  ♦  and  *). 

( C6)  QUAU( 3*X**2+4,X)Q 

(C  «  3,15  »  0, A  -  3) 

Line  ( C6)  demonstrates  that  summands  Xd  lilfi  pattern  which 

ACfi  fllssinfi  ID  Ihfi  DAPreSSion  ADS  matched  with  £.  This  is  what 

happened  to  the  term  B*X  in  the  QUAD  pattern.  Furthermore,  1£  A 

product  la  matched  iiLtli  n,  ana  of  Its.  factors  must  match  £.  Thus 

for  B*X  to  match  0,  B  must  match  0. 

( C7)  QUA0(X**2+3*X+4,X)Q 
(07)  (C  -  4, B  -  3, A  -  1) 

•  hat  is,  factors  Id  ihe  pattern  iihlch  ad e  missing  in  the  exnres- 
aifiD  ADD  matched  tilth  1.  This  assigns  to  A  the  value  1. 

Since  DFFMATCH  actually  produces  short  programs  (e.g. 

QUAD),  the  matching  programs  may  be  compiled  by  a  LISP  compiler 
into  machine  code  for  increased  speed.  The  program,  QUAD,  pro¬ 
duced  above,  is  shown  in  this  chapter's  Appendix  ii. 

To  help  prevent  the  user  from  asking  for  ambiguous  matches 
(where  they  can  be  detected),  the  match  compiler  used  by  DEFMATCH 
has  a  number  of  warning  messages.  Generally  they  indicate  points 
where  there  is  a  likelihood  that  the  user  has  submitted  a  pattern 
which  is  ambiguous,  or  could  be  more  suitably  constructed  for 
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optimal  matching,  in  general/  patterns  should  be  expanded  so  that 
the  full  freedom  of  commutative  operators  can  be  exploited.  The 
pattern  x**2-y**2  will  match  a  wider  range  of  expressions  than 
the  pattern  (x*y)*(x-y).  The  latter  will  match  only  expressions 
which  are  the  product  of  two  sums  of  the  specific  syntactic  form 
used.  This  asymmetry  with  respect  to  patterns  and  expressions 
(the  expressions  x**2-y**2  and  (x+y)*(x-y)  will  be  treated 
identically  by  most  pattern  programs)  is  a  consequence  of  the 
fact  that  it  is  far  easier  to  multiply  out  sums  and  pick  out 
coefficients/  than  it  is  to  factor  polynomials.  We  allow  either 
pattern  however/  since  it  is  possible  that  the  latter/  strictly 
syntactic  match  (like  those  available  In  Formula  Algol  or  FAMOUS) 
might  be  of  some  use  anyway. 

Since  backing  up  (i.e./  abandoning  assignments  of  values 
and  trying  new  ones)  is  not  done  in  the  matching  process/  the 
user  should  consider  whether  his  intentions  will  he  properly 
represented.  While  a  back-up  algorithm  could  have  been  adopted/ 
the  potentially  great  increase  in  cost/  combined  with  no 
assurance  that  the  user  would  be  happy  anyway/  make  such  an 
approach  somewhat  unattractive,  (it  should  be  said/  however/ 
that  in  cases  where  heuristics  and  back-up  are  part  of  the 
processing  itself/  as  in  early  stages  of  SIN  (35)  it  may  be 
convenient  to  use  the  pattern  matching  program  for  the  basis  of 
heuristics.)  There  is  the  further  argument  that  pattern-match 
problems  can  be  easily  constructed  which  are  undecidable  (in  the 
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Tur Ing-Church  sense),  so  back-up  will  not  solve  all  our  problems. 
SCHATCHEN  uses  back-up;  back-up  is  expensive,  and  as  is  demon¬ 
strated  by  the  examples  in  this  paper,  the  lack  of  back-up  is 
often  not  even  noticed.  This  is  discussed  further  in  section  C. 


An  example  which  demonstrates  how  backing-up  might  be 

implied  by  a  pattern  follows: 

(Cl)  DECLAKE(A,TRUE)$ 

( C 2 )  UECLARE(B,FREEOF(Y))* 

( C3)  DEFMATCH(NEED6ACKUP,  S J N( A)+S J N( B) ) $ 

(C4)  NEEDBACKUP(S I N(X)+S I  N(  Y) ) $ 


The  final  line  may  match  with  (A  ■  Y,  B  ■  X);  but.  If  A  >  X 
Is  tried  first  (succeeding),  and  then  B  -  Y  Is  attempted,  the 
pattern  will  fail. 

One  method  of  circumventing  this  difficulty  is  as  follows: 
(RETLIST  returns  its  argument  list  as  a  sequence  of  equations, 

,s  the  assignment  operator,  and  [J  is  used  to  enclose  a  list 
consisting  of  local  (i.®.,  "dummy")  variables  within  a  BLOCK.) 


(Cl)  DECLARE( A, TRUE) $ 

( C2)  DECLARED,  TRUE)* 

( C3)  DEFMATCH(PAT,SIN(A)+SIN(B))$ 

(C4)  DOESBACKUP(Z) : *  I F  PAT(Z)-FALSE  THEN  FALSE 
ELSE  IF  FREEOF( Y,B)  THEN  RETLIST(A,B) 

ELSE  BLOCK  ([TEMP], 

TEMP:  A, 

A:  B, 


B.-TEMP, 

RETLiST(A,B))$ 


The  purpose  of  the  fancy  ELSE  clause  In  C4  is  to  reverse  the 
assignment  of  values  to  A  and  B  In  the  returned  list.  Thus,  while 
a  conscious  design  decision  was  made  to  prevent  back-up,  the 
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possibility  of  simulating  It,  when  necessary.  Is  available. 

The  fact  that  we  insist  on  completely  directed  or 
"anchored"  (12)  searches  In  a  pattern  is  both  a  strength  and  a 
weakness.  Some  patterns  are  inherently  ambiguous,  and  all 
possible  types  of  matches  must  be  explored.  This  Is  the  case  in 
symbolic  integration.  If  such  ambiguous  patterns  are  the  rule, 
rather  than  the  exception,  we  would  be  seriously  inconvenienced 
by  having  to  simulate  back-up  (as  above),  in  every  case. 

Arbitrary  n-ary  functions  may  be  used  in  a  pattern,  as  is 
i 1  lustrated  below: 

(Cl)  UECLARE(F,TRUE)$ 

( C 2 )  DECLARED,  TRUE)* 

(C3)  DECLARED, TRUE)* 

( C4)  DEFMATCiU F2, F(X, Y) )$ 

( C5)  F2( P01NT( 3, 4) ) £ 

(D5)  (Y  ■  4, X  ■  3, F  -  POINT) 

It  is  also  possible  to  execute 

(CO  F207+4)  £ 

<Dt»)  (Y  ■  W,  X  ■  4,  F  -  MPLUS ) 

This  gives  a  facility  for  explicitly  matching  operators, 
if,  for  example,  F  is  declared  to  match  only  MPLUS.  This  facility 
could  be  used  to  simulate  simpler  styles  of  pattern  matching 
which  are  completely  syntax  based. 

2.5,  Markov  Algorithms 

Users  of  a  mathematical  laboratory  may  find  that  certain 
algorithms  lend  themselves  to  an  organization  based  on  the  Markov 
algorithm  formalism:  a  list  of  rules,  each  consisting  of  a 
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pattern-replacement  pair  Is  applied  to  an  expression.  FAMOUS 
(16),  PANON-IB  (7),  AMBIT/S  (8),  Formula  Algol  (37),  and  SNOBOL 
(12),  among  others,  are  based  on  such  a  formalism.  In  order  to 
allow  MACSYMA  algorithms  to  be  written  In  such  a  style,  a  command 
to  define  rules,  DEFKULE,  is  provided,  along  with  sequencing 
algorithms.  The  form  of  the  DEFRULE  command  Is: 

DEI  RULE(jm.|finaflie/flattern.reDlacementlia. 

If  the  rule  named  nUananm  Is  applied  to  an  expression  (by 
one  of  the  APPLY  programs  below),  every  subexpression  matching 
the  Pattern  will  be  replaced  by  the  replacement.  All  variables  In 
the  replacement  Which  have  been  assigned  values  by  the  pattern 
match  are  assigned  those  values  in  the  .replacement  which  is  then 
simplified.  The  rules  themselves  can  be  treated  as  programs  which 
will  transform  an  expression  by  one  operation  of  pattern-match 

and  replacement,  if  the  pattern  fails,  the  value  of  the  rule  Is 
FALSE. 

2.5.1  Applying  Rules 

Each  of  the  programs  described  in  this  section  applies  Its 
rules  to  the  expression  Indicated  by  Its  first  argument, 

recursively  on  that  expression  and  Its  subexpressions,  from  the 
top  down. 

APPLYl(£,£t,  £2,...,£n)  applies  the  first  rule,  £,,  to  the 
expression  *  until  It  falls,  and  then  recursively  applies  the 
same  rule  to  the  subexpressions  of  that  expression,  lef;-to- 
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right/  until  the  first  rule  has  failed  on  all  subexpressions. 

Then  the  second  rule  is  applied  in  the  same  fashion.  ‘When  the 
final  rule  fails  on  the  final  subexpression/  the  application  is 
finished. 

APPLY2($,  £,  ,£z,  . .  .,£n)  differs  from  APPLY1  In  that  If  the 
first  rule/  £  fails  on  a  given  subexpression/  then  the  second  is 
applied/  etc.  Only  if  they  all  fall  on  a  given  subexpression  is 
the  whole  set  of  rules  applied  to  the  next  subexpression,  if  one 
of  the  rules  succeeds/  then  the  same  subexpression  is  repro¬ 
cessed/  starting  with  the  first  rule. 

APPLY1  corresponds  to  Formula  Algol's  (23)/  (37)  one-by-one 
sequencing  mode/  and  APPLY2  corresponds  to  its  parallel 
sequencing  mode  (with  the  inessential  difference  that  Formula 
Algol  processes  from  right  to  left). 

Thus  If  Rl/  R2/  R3/  and  R4  are  rules  defined  by  DEFRULE,  a 
program  might  be  written  using  them  as  follows: 

PROGRAM(X) : "APPLY 1( APPLY 2 ( X/R3/  R4)/R1/R2)$ 

and  the  Markov-style  algorithm  represented  by  PROGRAM  could  be 
executed  on  the  expression  Y  by 
Z:  PROGRAM Y)@ 

2.5.2  An  Example 

Here  is  an  example  of  using  rules  to  alter  an  expression. 
The  symbol  S  is  used  as  an  abbreviation  for  eZ,  RATS  IMP  (see 
Chapter  3)  expands  an  expression  into  a  ratio  of  polynomials  and 
cancels  common  factors,  and  the  symbol  i  always  denotes  the  most 
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recently  displayed  expression. 


(Cl)  DEFRULE(R1,SECH(Z),  .’./COSH(Z) ) $ 

(C2)  DEFRULE(R2/TANH(Z)/S i NH(Z)/C0SH(Z) )$ 
(C3)  DEFRULE(R3/SINH(Z)/(S-l/S)/2)$ 

( C4)  DEFRUI.E(  IU,  CUSH(Z) ,  (S+l/S)/2)$ 

(C5)  SECH(Z)**2*TANH(Z)**2@ 

2  2 

(05)  TANH(Z)  ♦  SECH(Z) 

(CG)  APPLY1(*,RI,R2/R3/R4)Q 


2 

1 

(S  -  -) 

4  S 

(D6)  . . . 

2  2 
1  1 

(S  ♦  -)  (S  ♦  -) 

S  S 

(C7;  RATSIMP(*)Q 

(07)  1 


2.6.  Advising  the  Simplifier 

When  the  user  of  a  system  like  MACSYHA  introduces  new  func¬ 
tions  or  uses  old  functions  in  a  way  that  is  unfamiliar  to  the 
system/  he  may  find  himself  battling  certain  '’built-in"  aspects 
of  MACSYMA. 

On  one  hand/  he  may  find  that  the  SIMPLIFY  program  does  not 
simplify  expressions  the  way  he  wants  it  to.  While  he  can  work  at 
odds  with  the  simplifier  to  some  extent  by  using  Markov-style 
algorithms  on  his  data/  the  global  and  al 1 -pervasive  influence  of 
the  simplifier  must  sometimes  be  modified.  Although  the  user 
could  just  turn  off  the  simplifier/  this  solution  is  probably  not 
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very  useful.  The  chances  are  that  he  still  wants  the  simplifier 
to  work  on  most  of  the  expression  under  consideration,  but  not  on 
some  particular  part  In  some  particular  fashion. 

On  the  other  hand,  he  may  find  that  the  SIMPLIFY  program  Is 
just  Ignorant  of  functions  of  Interest  to  him.  For  example,  a 
user  may  wish  to  see  SINH(O)  replaced  by  0  whenever  It  occurs, 
especially  If  It  occurs  inside  a  calculation.  He  nay  also  wish 
to  tell  the  simplifier  that  X**N  is  0  for  N  greater  than  some 
number  M.  This,  In  effect,  allows  one  to  truncate  while  doing 
arithmetic  on  power  series. 

For  these  reasons,  an  advising  facility,  similar  In  certain 
respects  to  Teltelman's  ADVISE  (44)  has  been  Implemented.  There 
are  two  commands  to  advise  the  simplifier:  TELLS  IMP,  and 
TELLSIMPAFTER.  They  have  the  following  forms: 

TELLS INPCpaifcfirn/  replacement) H 
TELLSIMPAFTER(jiaLUcn/  reel  acement ) H 

The  arguments  are  similar  to  those  of  DEFRULE,  but  the 
paturn  must  conform  to  certain  restrictions  described  below. 

TELLS  IMP  analyzes  the  pattern,  and  If  It  Is  either  a  sum,  a 
product,  or  an  atom  ( I • e •  a  single  variable  name  or  a  number)  it 
will  complain.  Sums  and  products  are  excluded  by  TELLSIMP  because 
of  the  interdependence  of  the  simplifier  and  the  matching  pro¬ 
grams  in  this  implementation.  TELLSIMPAFTER,  discussed  at  the 
end  of  this  section,  has  no  such  restriction. 
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The  exception  for  atomic  variables  is  necessary  because  the 
advice  is  stored  on  the  property  list  of  operators,  where 
SIMPLIFY  looks  for  It.  SIMPLIFY  does  not  look  on  the  property 
list  of  variables  for  simplification  advice.  This  restriction, 
however,  is  hardly  important,  since  setting  a  variable  to  its 
"simplified"  form  will  give  the  same  effect. 

The  simplification  of  sums  and  products  should  probably  be 
attacked  in  ways  other  than  through  TELLS  IMP  TELLS IMPAFTER. 
it  is  simple  (but  somewhat  naive)  to  suggest  that  (sin  x)**2  ♦ 
(cos  x)**2  ■■>  1  be  told  to  the  simplifier  as  TELLS  IMP 
(SIM(X)**2,l-COS(X)**2);  what  is  really  needed  is  a  facility  that 
demands  the  presence  of  both  sines  and  cosines,  and  removes  them 
in  appropriate  circumstances.  All  the  above  rule  does  is  remove 
sines  in  favor  of  cosines,  sometimes. 

TELLSIMPAFTER(SIN(X)**2*COS(X)**2, 1),  although  a  legal  command, 
does  far  less  that  the  user  may  think.  For  example,  it  leaves  out 
the  possibility  of  a  third  term  In  the  sum  (e.g., 
5+sln(y)**2+cos(y)**2),  It  does  not  back  up  (e.g., 
sin(y)**2+cos(2*y)**2+sin(2*y)**2)  and  It  does  not  detect 
instances  of  the  pattern  implicit  in  such  constructions  as 
sln(y)**4+2*sln(y)**2*cos(y)**2+cos(y)**4.  While  patterns  may  be 
constructed  for  some  of  these  expressions.  It  Is  our  opinion  that 
such  substitutions  as  sin(x)**2+cos(x)**2  ■■>  1  require  much 
stronger  methods  than  pattern  matching.  Methods  for  doing  such 
simplifications  effectively  are  available  In  the  rational 
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substitution  facility  of  MACSYMA  described  In  chapter  3.  In  it 
the  approach  used  by  REDUCE  to  handle  products  (17,  p.  8),  is 
Implemented,  but  is  extended  to  deal  with  sums  also. 

TELLSIMP  plies  new  advice  on  top  of  old  advice,  but  old 
advice  Is  still  accessible  If  the  new  advice  is  not  appropriate 
(I.e.  the  pattern  falls).  This  Is  exhibited  in  the  following 
example. 


(Cl)  COi>( PI  )Q 

(Dl)  COS(PI) 

( C 2 )  TELLSIMP(COS( PI ),-l)@ 

IS  THE  REPLACEMENT 

(D2)  COS 

(C3)  COS ( P I ) (j 

(D3)  -  1 

(  C4)  COS( -PI ) Q 

(04)  C0S(  -  Pi) 

( C5)  MPRED(X) :  «l  F  (SI GNUM(  X)«-1)THEN  TRUE  ELSE  FAf.SE$ 
( C6)  DECLARE(M,MPRED)$ 

(C7)  TELLS IMP(C0S(M),C0S(-M))$ 

(C8)  C0S(-PI)Q 
(08)  -  1 

( C9)  C0S(5*PI)lj 

(09)  COSO  PI) 

(CIO)  DECLARED,  INTEGER)* 

(Cll)  TELLSIMP(COS(N*PI),  (-1)**N)$ 

(C12)  C0S( 5* P I ) (J 

(D12)  -  1 


(C13)  COS ( — 6 ) (J 
(D13) 


COSO) 
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The  dialogue  above  shows  (Dl)  that  the  simplifier  (at  that 
time)  did  not  know  the  rules  about  pi  (-3.1415+) .  If  we  tell  It 
that  the  cosine  of  pi  Is  -1,  It  can  (D3)  simplify  COS(Pl)  to  -1. 
Line  (D4)  demonstrates  that  the  simplifier  did  not  know  about 
cosine  being  symmetric  about  0.  Lines  (C5)-(C7)  add  this  bit  of 
information,  as  evidenced  by  line  (D8).  Line  (Cll),  which  makes 
superfluous  the  advice  of  (C2),  but  not  of  (C7),  adds  the  capa¬ 
bilities  shown  in  (D12).  (C13)  shows  that  the  old  advice  is  still 
accessible. 

One  of  these  rules  happens  to  coincide  with  a  "built-in" 
simplification  COS(O)  -  1,  since  N* PI  for  N-0  matches  0; 
however,  since  the  answer  will  be  (-1)**0,  the  ordinary  operation 
of  the  simplifier  underneath  will  not  be  affected.  (System- 
defined  simplifications  will  be  tried,  but  only  if  none  of  the 
advice  is  applicable.  Note  that  if  any  of  the  advice  is 
applicable,  the  replacement  part  of  the  advice  will  have  already 
triggered  a  further  slmpl ication,  if  such  is  possible.) 

TELLS IMPAFTER  is  similar  to  TELLSIMP  except  that  new  rules 
are  placed  after  old  rules  and  "built-in"  simplifications. 

Because  of  this,  TELLS IMPAFTER  cannot  be  used  to  drastically 
alter  the  action  of  the  slmpl Ifer,  whose  "built-in"  simplifica¬ 
tions  take  precedence.  On  the  other  hand,  these  restrictions  make 
it  possible  to  apply  TELLS IMPAFTER  to  sums  and  products. 

TELLS IMPAFTER  should  be  used  on  "built-in"  operators 
whenever  possible,  since  such  rules  will  be  applied  only  if  the 
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same  operator  is  still  the  lead  operator  after  the  previous  sim¬ 
plification  has  been  performed,  if  the  lead  operator  has  been 

changed,  all  "after"  rules  are  bypassed,  producing  faster  opera¬ 
tion. 

2.7.  Non-Commutatlve  Multiplication 

At  this  time,  a  standard  non-commutative  multiplication 
simplification  program  is  not  generally  included  in  MACSYMA. 

There  are  several  different  programs  available,  but  it  may  be  the 
case  that  none  of  them  does  exactly  what  is  required  in  a  given 
problem  area.  This  section  describes  how  one  might  add  a  fairly 
extensive  hand-tailored  facility  by  using  the  TELLS  IMP  commands. 
The  group  operation,  represented  by  a  period  (.),  is  allowed  by 
the  parser  in  anticipation  of  the  time  when  an  efficient  non- 
commutati ve  multiplication  scheme  is  programmed  in  LISP.  (Since 
the  same  symbol  is  used  to  denote  the  decimal  point  of  a  floating 
point  number,  extra  parentheses  may  sometimes  be  required  to 
avoid  misinterpretation.) 

Telling  the  simplifier  about  non-commutative  multiplication 
requires  a  bit  of  knowledge  of  the  internal  representation.  The 
input  A.B  is  parsed  to  ( (MCTiMES )  $A  $B),  that  is,  a  prefix 
representat ion  (although  with  certain  peculiarities  of  no 
importance  to  this  discussion).  The  fact  that  MCTiMES  is  a  binary 
operator  rather  than  a  "varl-ary"  operator  will  complicate  mat¬ 
ters  somewhat.  We  will  abbreviate  ((MCTIMES)  $A  $B)  as  (.  A  B). 
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The  Input  A.B.C  or  (A.B).C  is  parsed  to  (.  (.  A  B)  C),  but 

a.(B.C)  is  parsed  to  (.  A( .  B  C)).  Clearly  one  of  the  first  jobs 

of  the  "MCTIMES"  simplifier  is  to  transform  the  second  structure 

into  the  first.  To  do  this  (in  effect,  telling  the  simpltfier 

about  the  associative  law),  we 

DECLARE( A,TRUE) $ 

DECLARER, TRUE)$ 

DECLARE(C,TRUE) $ 

TELLSIMP(A.(B.C),(A.B).C)$ 

As  an  example  of  how  this  operates,  consider  (A.B).(C.D). 
This  is  parsed  to  (.  (.  A  B)(.  C  D))  which  is  then  simplified  to 

(.(.(.  A  B)  C)  D) .  Since  the  simplifier  is  recursive,  any  depth 
of  forced  nesting  is  untangled. 

Any  time  two  identical  elements  are  adjacent,  we  want  to 
combine  them.  That  is,  A. A  »  A  ;  more  generally,  A  .A  *  A  • 
Since  our  pattern  matcher  is  clever  enough  to  recognize  A  as  an 
occurrence  of  A  ,  this  one  pattern  would  suffice,  but  for  one 
difficulty:  although  A. A  is  parsed  to  ( .  A  A),  B.A.A  is  parsed  to 
(.  (.  B  A)  A).  These  two  situations  differ  sufficiently  with 
respect  to  adjacency  of  the  A's  so  as  to  require  the  two  patterns 
below. 

DECLARED,  TRUE)  $ 

DECLARE(M,TRUE)$ 

TELLS IMP( ( A**H) . ( A**N),A**(M+N) )$ 

TELLS IMP(B. ( A**M) . (A**N),B.A**(M+N) )$ 

Let  us  denote  the  inverse  of  A  by  INV(A),  and  the  Identity 
by  1.  We  might  then  have 
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TELLS IMP( iNV( 1) ,  1) $ 

TELLS IMP( i NV ( i NV( A) ) # A) $ 

TELLSiMP(INV(A.B), INV(B). INV(A))$ 

Recall  that  these  pieces  of  advice  are  placed  on  the  pro¬ 
perty  list  of  the  function  INV#  and  so  are  independent  of  the 
previous  bits  of  advice#  which  are  on  the  property  list  of 

Another  piece  of  advice  which  will  be  needed  goes  on  the 
property  list  of  "**"  —  this  time#  after  other  simplifications 
have  been  made: 

TELLS iMPAFTER( I  NV(A)**N# 1 NV(A**N) )$ 

The  major  fact  concerning  inverses  is  their  "cancellation" 

property.  That  is#  A.INV(A)  -  INV(A).A  -  1.  To  automate  this#  let 

,  .  n  m  j  k 

us  consider  the  more  general  situation,  (A  ).INV(A  )  -  A  *iNV(A  ) 

where  at  least  one  of  j  or  k  is  0. 

Let  us  define  M0NUS(N#M)#  which  will  compute  j  and  k: 

M0NUS(N#M):»  IF  N>M  THEN  N-M  ELSE  0$ 

and  I NVPR0G(A# N#M)  which  will  compute  the  right  hand  side  of  the 
above  reduction  formula. 

I NVPR0U( A# N#M) : »  A**M0NUS (N#M) * i NV( A**M0NUS(M# N) ) $ 

Thus: 

TELLS  iMP(  (  A**N) .  I NV( A**M)# INVPR0G(A,N,M))$ 

TELLS IMP( I NV( A**M) . ( A**N) , I NVPR0G( A#  N#M) ) $ 

TELLSIMP(U. (A**N) . 1NV(A**M)#B. INVPR0G( A# N,M) ) $ 

TELLS IMP(B. i NV( A**M) . ( A**N) #  b. I NVPR0G( A#  N,M) ) $ 

Finally# 

0ECLARE(N# INTEGER)$ 

TELLS IMP(N. A#N*A)$ 

TELLSiMP(A.N#N*A)$ 
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gives  us  such  useful  notions  as  left  and  right  zeros,  identities, 

and  nultipl ication  by  scalars.  It  nay  appear  that  we  have  left 

out  some  items,  for  example, 

TELLSIMP(A**0,1)$ 

TELLSIMP( I NV( A) **0, 1)$ 

TELLSIMP(1.A,A)$ 

but  this  is  not  so.  Since  l.A  will  be  converted  to  1*A,  which 
will  be  simplified  to  A,  the  last  rule  Is  unnecessary.  Since  A**0 
will  (unless  we  tell  the  simplifier  otherwise)  always  result  in 
1,  the  other  two  arc  also  unneeded. 

As  examples  of  how  this  new  simplifier  operates, 

X. IUV(X)**2  is  simplified  to  iNV(X),  and  A.B.(B**3).C. INV(C)  is 
simplified  to  A.B**4.  This  last  example  used  about  .7  seconds  of 
machine  time  when  the  simplification  rules  were  in  uncompiled 
LISP  (on  a  PUP-10  computer  using  2.75  microsecond  cycle  time 
memory),  and  when  compiled  by  the  LiSP  compiler,  about  .05  sec. 

2.8.  Comparisons  with  SCHATCHEN,  FAMOUS, 

REDUCE,  Formula  Algol 

SCHATCHEN  (35),  Hoses'  matching  program  is  similar  to  our 
matching  program  in  many  respects.  However,  there  are  significant 
differences,  both  in  implementation  and  in  philosophy,  between 
the  two  systems. 

SCHATCHEN  demands  patterns  in  a  form  resembling  the 
internal  form  for  expressions,  it  uses  controls  (called  nodes)  on 
the  pattern  match  to  direct  its  highly  recursive  matching  pro¬ 
cesses.  Our  "straight-line"  matching  programs  preserve  some,  but 
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not  all,  of  the  aspects  of  the  mode  facility. 

A  SCHATCHEN  pattern  corresponding  to  the  Intuitive  notion 
of  "quadratic  in  x"  discussed  In  section  4  Is: 


(QUOTE 
(PLUS 
( COEFFPT 
(A 

(FUNCTION 
(LAMBDA  (Y) 


( EXPT  X 
(COEFFPT 


2)) 


(AND  (FREE  Y  (QUOTE  X)) 

(NOT  (EQUAL  Y  0)))))) 


(B  (FUNCTION  (LAMBDA  (Y) 

(FREE  Y  (QUOTE  X))))) 

(COEFFP 

(C 


(FUNCTION  (LAMBDA  (Y) 

(FREE  Y 


(QUOTE  X) ))))))) 


This  Is  not  in  the  best  possible  form  for  SCHATCHEN,  but  It 
serves  to  illustrate  several  points.  First,  the  pattern  is 
written  as  a  LISP  S-expression  which,  upon  close  examination,  has 
most  of  the  components  of  a  prefix  representation  of  the 

2 

algebraic  expression  AX  +BX+C.  Second,  there  are  a  number  of 
extra  notations  In  the  pattern,  some  of  which  clearly  depend  on 
LISP's  version  of  the  lambda-calculus.  A  less  obvious  point  Is 
that  the  pattern  implies  an  ordering  on  the  subtasks  required  to 
match  it  to  an  expression. 

There  are  two  modes,  COEFFPT  and  COEFFP,  used  in  this 
pattern.  They  stand  for  "coefficient  in  plus  and  times"  and 
"coefficient  in  plus"  respectively,  and  their  uses  are  best 


55 


described  through  an  example. 

2  2 

Consider  the  quadratic,  Q  *  2X  ♦  YX  ♦  3  ♦  Z.  There  are 
2  2 

two  terms  involving  X  .  For  the  pattern  AX  +  BX  ♦  C  to  match  Q, 
A  must  match  2  +  Y.  This  is  indicated  to  SCHATCHEN  by  using  the 
indicator  COEFFPT.  This  modifies  the  action  taken  to  match  A  by 

causing  SCHATCHEN  to  traverse  Q  looking  for  coefficients  of  X* 
and  assigning  to  A  the  simplified  sum  of  those  coefficients. 
Similarly,  by  matching  B  with  mode  COEFFPT,  B  is  assigned  the 
simplified  sum  of  the  coefficients  of  X  (or  is  assigned  zero  if 
there  are  no  coefficients,  as  is  the  case  for  Q) . 

SCHATCHEN  requires  that  C  in  the  quadratic  pattern  be 
matched  using  the  mode  COEFFP  (that  Is,  "coefficient  In  plus")  so 
that  in  Q,  C  will  match  Z  ♦  3,  and  not  just  one  term  (e.g.  Z  or 
2 

3).  Since  AX  and  BX  have  been  previously  deleted  from  the  ex¬ 
pression  by  the  matching  procedure,  C  (by  virtue  of  its  being 

indicated  a  COEFFP)  will  match  what  is  left  in  the  sum,  namely  Z 
+  3. 

SCHATCHEN  also  provides  opportunities  to  apply  predicates 
to  A,  B,  and  C;  in  this  case  they  each  are  checked  to  make  sure 
they  are  free  of  X.  A  is  also  checked  to  assure  it  is  nonzero. 

Compared  to  the  relatively  casual  definition  of  QUADRATIC 
in  section  4,  using  these  controls  requires  a  high  level  of 
awareness  on  the  part  of  the  user,  both  of  the  representation  of 
data,  and  the  operation  of  SCHATCHEN.  This  burden  of  awareness  is 
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considerable.  However,  SCHATCHEN  matches  differ  from  the  matches 
done  here  in  a  more  fundamental  sense.  We  find  a  particular 
subexpression  and  apply  a  predicate,  if  the  predicate  fails,  the 
match  fails.  In  a  similar  situation,  SCHATCHEN  will  try  to  find 
another  subexpression  which  matches  the  subpattern,  which  might 
satisfy  the  predicate.  The  match  falls  only  if  this  exhaustive 
search  fails  to  find  any  subexpression  matching  (and  satisfying) 
the  subpattern. 

This  difference,  which  would  seem  to  indicate  that 
i>  CHAT  CHEN  is  more  powerful,  is  somewhat  deceptive.  We  use  more 
powerful  tools  to  find  an  appropriate  place  to  apply  a  predicate, 
and  then  apply  it  only  once.  (The  coefficient-finding  routine  we 

,  2 
use  can  find  that  the  coefficient  in  (2x)(3x+l)  of  x  is  6; 

SCHATCHEN  would  fail  to  notice  this.)  There  is  an  increase  in  ef¬ 
ficiency  since  the  programs  produced  by  the  match  compiler  are 
"straight-line"  code,  and  apply  predicates  (assuming  success) 
only  as  many  times  as  there  are  distinct  variables  in  the  pat¬ 
tern.  In  case  the  pattern  fails,  fewer  predicates  are  applied. 

The  number  of  times  SCHATCHEN  applies  its  predicates  is  much  more 
dependent  on  the  expression.  While  SCHATCHEN  has  certain  types  of 
iterative  facilities  within  a  single  pattern,  the  programming 
language  facility  in  MACSYMA  can  supply  some  of  the  same 
iterative  machinery,  as  in  section  5. 

There  are  some  instances  where  SCHATCHEN  is  undeniably  more 
thorough  (within  the  scope  of  a  single  pattern):  if  the  pattern 
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B 

is  A  and  the  expression  Is  1,  either  B  matching  0  ql  (B's 
predicate  failing)  A  matching  1  will  cause  the  pattern  to 
succeed.  We  insist  that  A  match  1  and  B  match  0. 

TELLS  IMP  gives  essentially  all  the  power  of  FAMOUS  for 
flexibly  altering  an  algebraic  simplifier/  yet  allows  one  to  have 
a  quite  competent  "fall-back"  facility.  While  using  TELLS  IMP 
excessively  on  commonly  used  operators  might  make  the  system  run 
as  slowly  as  did  FAMOUS/  It  is  unlikely  that  that  point  will  be 
reached  either  frequently  or  quickly.  Using  TELLSIMP  on  new  func¬ 
tions  (e.g.  SINH)  does  not  affect  the  speed  of  the  simplifier  on 
old  functions.  The  technique  of  compiling  rules  achieves  a  modest 
level  of  efficiency;  using  the  LISP  compiler  further  speeds  up 
processing.  Of  course/  advice  requiring  much  computation  (e.g., 
replace  INV(A)  where  A  is  a  square  matrix,  by  its  computed 
inverse)  will  slow  up  the  simplifier  in  direct  proportion  to  the 
length  of  the  computation/  and  how  often  it  is  done.  Easy  advice, 
in  this  user's  experience,  has  not  caused  a  noticeable  change  in 
system  response.  More  precise  measurements  can  be  made,  of 
course,  but  very  little  unnecessary  system  degradation  is 
introduced  by  the  particular  techniques  used.  (Some  timing  data 
appeared  at  tl.o  end  of  section  7).  Furthermore,  the  TELLS IMPAFTER 
facility,  potentially  far  more  efficient  than  a  last-in  first-out 
rule  organization,  is  available. 

It  is  clear  that  flexible  pattern  matching  results  in  an 
enormous  decrease  in  the  number  of  rules  required  to  achieve  a 
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given  match.  Consider  the  rules  that  would  be  required  to  define 
"quadratic  in  x"  in  a  purely  syntactic  manner#  as  in  FAMOUS  or 
Formula  Algol : 


x**2 

x**2  +  x 
x**2  ♦  b*x 
x**2  +  c 
x**2  ♦  x  ♦  c 
x**2  ♦  b*x  ♦  c 


a*x**2 
a*x**2  ♦  x 
a*x**2  ♦  b*x 
a*x**2  ♦  c 
a*x**2  ♦  x  ♦  c 
a*x**2  ♦  b*x  ♦  c 


This  also  assumes 

(1)  ♦  and  *  are  commutative  with  respect  to  the  match; 

(2)  a#  b#  and  c  may  be  declared  free  of  x; 

(3)  a#  b#  and  c  may  each  match  more  than  one  term; 
and  (4)  the  minus  sign  is  not  a  separate  operator. 

This  Is  not  meant  to  imply#  however#  that  restricted  styles 
of  matching  are  never  appropriate.  By  using  restricted  matches# 
Fenichel  was  able  to  justify  his  contention  that  arbitrary  and 
precisely  specified  algorithms  could  be  constructed  in  FAMOUS. 
Itturiaga  (23)  used  similar  techniques  in  Formula  Algol  to  pro¬ 
duce  somewhat  more  practical  results#  but  the  syntactic  (rather 
than  semantic)  nature  of  Formula  Algol  pattern  matching  prevented 
the  tackling  of  difficult  problems  in  a  natural  fashion.  FAMOUS 
and  Formula  Aigol  insist  that  expressions  look  very  nearly  like 
the  pattern  which  is  used  to  match  against  them.  Fenichel' s 
super-match  proposal#  implemented  in  (32)#  changes  each  single 
pattern  into  a  large  number  of  similar  patterns  by  trans¬ 
formations  of  commutative  operators  (etc.).  This  is  scarcely  an 
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Improvement  In  efficiency/  and  appears  to  be  useful  only  as  a 
shorthand  In  writing  out  long  rule  sets.  By  contrast/  our 
semantic  approach  can  match  quadratics  which  do  not  resemble  an* 
of  the  above  twelve  forms. 

Dependence  on  local  syntactic  transformations/  another 
major  thread  In  FAMOUS,  has  serious  Implications  relative  to  ef¬ 
ficiency.  For  example,  the  ad  hac  treatment  of  "logsum"  ((16) 

if* 

page  42)  was  necessary  because  local  Information,  In  some  cases, 
has  to  be  propagated  outside  of  Its  Immediate  vicinity.  (The 
logsum  device  separated  sums  Into  logarithmic  terms  and  non- 
logarlthmlc  terms.  If  the  sum  occurred  In  an  exponent,  the  log 


.  ...  .  „  (x+ log( y) )  x 

term  became  a  coefficient  of  the  base.  Thus  e  =*>y  e 

If  the  sum  was  not  In  an  exponent,  a  great  deal  of  time 

has  been  wasted.)  Waste  of  this  sort  Is  avoided  by  MACSYMA  (and 


no  doubt  In  other  algebraic  manipulation  systems  not  tied  down  to 
local  syntactic  transformations)  by  considering  such  analyses  in 
a  top-down  fashion.  This  provides  sufficient  global  context  to 
distinguish  sums  occurring  in  exponents  from  sums  occurring 
outside  exponents. 


To  the  concept  of  spatial  or  syntactic  adjacency  must  be 
added  the  concept  of  adjacency  along  semantic  dimensions.  For 
example,  if  the  properties  of  an  exponent  are  adjacent  to  its 
base,  then  an  efficient  local  "logsum"  device  might  be 
constructed.  In  the  expression  f  +  g  +  h,  it  is  clear  that  f  and 
h  should  be  considered  just  as  adjacent  as  f  and  g.  What  is  less 
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clear  is  how  one  might  note  that  f  and  g,  being  integer-valued 
functions,  make  them  adjacent  along  a  semantic  dimension. 

MACsYMA  allows  information  to  be  stored  at  operator  nodes 
in  the  internal  tree  representation  of  expressions  (e.g.  "this 
expression  and  all  its  subexpressions  are  simplified")  which  has 
some  aspects  of  this  semantic  dimension.  This  "property  list"  of 
operators  has  turned  out  to  be  an  extremely  useful  design 
decision,  one  with  applications  to  many  difficult  implementation 
problems.  The  types  of  information  stored  on  these  nodes  will  no 
douoc  Decoine  more  varied  as  MACSYMA  continues  to  grow. 

Another  thread  in  FAMOUS  Is  reliance  on  the  Markov 
algorithm  formalism.  It  is  clear  that  some  algorithms,  (e.g, 
synthetic  division  of  polynomials)  are  difficult  to  program  in 
such  a  formalism.  These  algorithms  benefit  not  only  from  a 
different  style  of  program  organization,  but  also  from  a 
radically  different  data  representation.  Fenichel,  by  not 
modeling  any  sophisticated  polynomial  manipulation  capabilities, 
implicitly  recognized  this  limitation. 

In  summary,  FAMOUS  and  Formula  Algol  cannot  compete  with 
MACSYMA  with  regard  to  efficiency  or  ease  of  use  in  algebraic 
manipulation  on  several  grounds: 

(1)  the  lack  of  a  competent  base  simplifier  (FAMOUS  assumes 
nothing  about  the  characteristics  of  its  data,  and  cannot  assume, 
therefore,  that  any  particular  simplifications  would  always  be 
valid;  Formula  Algol  nas  only  trivial  built-in  simplifications.), 

(2)  the  inflexibility  of  the  rules  (a  consequence  of  their 
syntactic,  rather  than  semantic,  nature), 

(3)  inefficient  rule-sequencing  techniques  (they  have  no 
equivalent  to  TELLS IMPAFTER) . 
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FAMOUS  has  additional  problems  because  of: 

(4)  its  requirement  that  *"he  Markov  algorithm  formalism,  and 
data  types  appropriate  to  it  bfc  used  for  all  manipulations, 

(5)  the  absence  of  facilities  for  global  communication. 

REDUCE  has,  in  addition  to  objection  (2)  above,  another 
problem.  It  considers  the  user-supplied  rules  only  after  it  has 
done  Its  own  simplifications.  Therefore  a  rule  X**l  **>  o  for  all 
I  will  not  prevent  X**0  *«>  1,  the  action  taken  by  the  simpli¬ 
fier.  Furthermore,  REDUCE  does  not  allow  sums  in  rules  at  the 
top  level.  KEUuCt,  although  probably  more  efficient  within  its 
domain  (19),  would  require  considerable  programming  to  extend  It 
to  the  realm  of  non-rational  functions,  a  domain  treated 
routinely  here. 

Finally,  It  is  not  certain  that  a  closer  model  of 
SCHATCHEN,  including  back-up,  but  (of  necessity)  closely  tied  to 
the  internal  representation,  would  greatly  aid  a  user  (except 
perhaps  a  system  programmer),  considering  the  burden  it  would 
impose.  The  benefits  of  our  implementation  are  clear:  we  give  a 
user  error  and  warning  messages,  the  selector  facility,  and  easy- 
to-use  methods  for  declaring  variables  and  defining  patterns.  For 
the  most  part,  he  can  remain  Ignorant  of  the  subtleties  of  LiSP 
and  the  data  representation  (a  sharp  contrast  with  SCHATCHEN), 
and  yet  define  powerful,  flexible  patterns. 
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2.9.  Differential  Equations 
The  following  example  of  a  dialogue  with  MACSYMA 
illustrates  the  usefulness  of  pattern  matching  in  constructing 
more  useful  programs.  We  wish  to  program  the  solution  of  ordinary 
linear  first-order  differential  equations,  i.e. 

DY 

F(X)  (— )  ♦  G(X)*Y  +  H(  X )  *  0 
UA 


where  F,  G,  and  H  are  functions  of  X,  but  not  of  Y.  The  solution 
can  be  written  in  terms  of  integral s,  as  demonstrated  by  the  pro 
gram  defined  on  line  C6,  below.  (Details  of  the  programming 
syntax  are  descrioea  in  Appendix  I  co  this  thesis.)  Note  that 
is  correct,  although  in  a  somewhat  unusual  form. 


(Cl)  DECLARE( F, NONZEROANUFREEOF ( Y) ) $ 
( C 2 )  DECLARE(G,FREEOF(Y))$ 

(C5)  DECLARE(H/FREEOF(Y) )$ 

( C4)  P  :  F*DERI VAT  I VE(Y,X)+G*Y+HS 
(  C5)  DEFMATCH(  PAT,  P,  Y,X)i«j 

DY 

F  (— )  ♦  G  Y  ♦  H 
DX 


IS  THE  PATTERN 

(D5)  PAT 


(Cb)  LI NDEP( EQ, Y, X )  : *B L0CK( [F, G, H, P, U,SOL 
IF  PAT( EQ, Y, X)=FALSE  THEN  FaLSE 


ELSE 

P  :  %E**(  I NTEGRATE( G/F, X) ) , 

Q  :  H/F, 

SOL:.Y*P+INTEGRATE(Q*P,X), 

EXPAND(SOLVE(SOL=CONST,Y)))s 


/ 
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<C7)  DERIVATIVE(Y,X)  +  3*Y+1ms 

DY 

(D7)  —  ♦  3  Y  ♦  4 

OX 

(C8)  LINDEPU,Y,X)$ 


(08) 


CONST 
Y . . 


3  X 
IE 


4 

3 


The  program  on  1 Ine  C6  could  easily  be  altered  to  account 
for  other  types  of  equations.  If  the  PAT  pattern  falls,  other 
patterns  could  De  tried,  each  with  Its  own  method  of  solution,  if 

none  of  the  patterns  succeed,  other  analytic  or  numerical  methods 
could  be  tried. 


2.10.  Other  Applications 

One  of  the  major  problems  of  algebraic  manipulation  systems 
has  been  the  lack  of  substantial  tools  to  aid  in  human 
comprehension  of  large  expressions.  Hearn,  in  (20),  explores  this 
problem.  He  displays  an  expression  with  a  large  number  of 
dependent  variables,  and  by  properly  choosing  substitutions  of 
expressions  for  variables,  produces  a  new  expression  reduced  in 
size  and  complexity.  This  requires  a  high  degree  of  human 
experimentation  and  interaction  with  the  computer.  In  chapter  3 
we  describe  more  sophisticated  substitution  methods  which  relieve 
the  user  of  some  of  his  headaches,  but  still  require  explicit 
"substitute  A  for  B"  type  commands.  By  contrast,  the  Markov 
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algorithm  processing  of  expressions/  combined  with  semantic  pat- 
tern  matching/  can  lead  to  more  genera)  styles  of  substitution: 
e.g.  For  any  Z,  substitute  Y(Z)  for  CO$(w*t+Z). 

Another  approach  toward  improving  comprehension  has  been 
the  automatic  "breaking-up"  of  expressions  at  (computer-chosen) 
positions.  The  parts  are  then  easier  to  display  (30)/  or 
manipulate  further  (10)/  (20)/  (34).  Unfortunately/  except  for 
special  cases,  the  computer-chosen  break  points  tend  to  obscure 
the  underlying  structure.  By  breaking  an  expression  up  at  points 
suggested  by  user-supplied  patterns,  and  renaming  the  pieces  (say 
by  allocating  coefficients  of  certain  types  and  locations  to  a 
matrix),  inherently  bulky  expressions  can  be  reduced  to  more 
craccaDle  sizes,  ms  a  simple  example,  the  pattern  A  +  B*%i,  for 
a  and  B  declared  free  of  %\  serves  to  separate  real  and  imaginary 
parts  of  an  expression. 

2.11.  Conclusions 

Although  a  pattern-directed  interpreter  (along  the  lines  of 
SCHATCHEN  or  FAMOUS)  could  have  been  written  to  implement  this 
algorithm,  a  compiler,  which  produces  a  LISP  program  from  the 
pattern,  was  written  instead.  There  are  several  advantages  to 
this  approach; 

1,  Elaborate  checking  is  done  at  compile-time,  to  help  insure 
that  patterns  make  sense.  An  interpreter  can  provide  this  only 
at  considerable  cost  at  execution  time.  This  makes  interpreta¬ 
tion  unattractive  to  a  user  who  needs  as  much  error-checking 
as  possible. 
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2.  When  the  match  compiler  is  no  longer  needed/  it  can  be  removed 
from  core  memory/  and  the  space  it  occupies  reclaimed.  Only 
the  pattern  programs  themselves  are  required  at  execution 
time.  An  interpreter  must  be  present  any  time  a  pattern  is 
matched.  It  is  possible  that  a  large  number  of  pattern 
programs  could  collectively  take  more  space  than  some  other 
pattern  represent  ion,  so  that  this  advantage  is  not  clear  cut. 
However/  judging  from  the  size  of  the  match  compiler/  we 
suspect  chat  an  interpreter  performing  the  same  tasks  is 
likely  to  be  sufficiently  large  so  as  to  be  more  space 
consuming  than  perhaps  40  pattern  programs. 

3.  With  the  exception  of  calls  to  the  simplifier/  the  coefficient 
routines/  and  calls  to  subroutines  to  find  exponents/  bases/ 
and  unknown  functions/  the  program  produced  by  the  DEFMaTCH 
(or  DEFRULE/  TELLS  IMP/  etc.)  command  is  self-contained.  The 
application  of  predicates/  the  assignment  of  values/  and 
sequencing  of  operations  is  rapid  and  efficient.  Furthermore/ 
each  pattern  program  can  be  compiled  into  machine  language  by 

a  LISP  compiler/  which  (on  the  PGP-10)  decreases  the  bulk  of 
the  program  and  may  increase  the  speed  by  a  factor  of  ten.  It 
may  appear  that  this  possibility  is  independent  of  the 
question  of  compilation  </s.  interpretation/  since  the  pattern- 
directed  interpreter  could  also  be  compiled  into  machine  code. 
This  is  not  the  point  we  are  making.  The  patterns  for  the 
interpreter  cannot  be  compiled  since  they  are/  of  necessity/ 
LISP  data.  On  the  other  hand/  the  pattern  programs  of  our 
system  can  be  compiled  completely  into  machine  code. 
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The  advantages  of  semantic  (as  opposed  to  syntactic) 
matching  are  clear.  Semantic  matching  as  Implemented  in  MACSYMA 
allows  the  user  to  introduce  new  Information  relying  on  a  wide 
range  of  previously  developed  Information  and  simplification 
rules.  Syntactic  methods  would  require  considerably  more 
efforts  (since  all  information  would  have  to  be  encoded  in  syntax 
only)  and  result  in  a  less  powerful  extension. 

Chapter  2  -  Appendix  i 

Detailed  description  of  the  MATCH  processor. 

Up  to  this  point  we  have  tried  to  show  mainly  by  examples, 
what  kinds  of  patterns  can  be  compiled.  By  describing  the 
algorithm  used  to  compile  patterns  into  programs,  this  appendix 
explicates  the  nature  of  the  semantic  matching  done  by  the 
resulting  programs.  Some  details  which  are  concerned  only  with 
"code  opt imi zat ion"  are  omitted  —  as  an  example,  the  predicate 
"TRUE"  is  never  actually  called,  since  the  result  is  known  to  the 
match  compiler.  However,  the  operatic?;  would  be  unaffected  if  a 
call  to  "TRUE"  were  actually  used. 

D6.f.in LLLgn:  An  unmatched  var  1  able  in  a  pattern  is  a  variable 
which  is  declared  and  for  which  no  value  has  yet  been  assigned 
during  this  matching  process.  A  variable  may  be  assigned  a  value 
either  by  being  in  the  list  of  patternvar 1 sf  or  by  being 
successfully  compared  to  an  expression,  a  pattern  £  i s  compared 
12  an  expression  s.  by  attempting  a  match  between  £  and  £.  if  the 
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match  succeeds,  all  unmatched  variables  in  £  will  be  assigned 

values.  If  the  match  falls,  the  value  FALSE  is 

returned. 

D.fif-i Rilifln:  If  a  pattern  £  has  no  unmatched  variables  In  It,  it 
is  called  a  £1  fifed  £fl££fe£n,  or  Is  said  to  be  fixed. 

Kemarki  Any  number  is  a  fixed  pattern.  Any  undeclared  "atomic" 
name  Is  a  fixed  pattern.  A  sum,  product,  (etc.)  of  fixed  patterns 
Is  a  fixed  pattern. 

UfefJjll  t  Lon:  A  pattern  Is  anchored  If  after  all  fixed  parts  have 
been  subtracted,  divided  out,  or  otherwise  removed  from  an 
expression  instance  of  the  pattern 

(1)  The  remaining  pattern  consists  of  an  isolated  unmatched 
variable  not  in  a  sum  or  product. 

or  (2)  There  is  at  least  one  fixed  subpart  of  the  pattern 
such  that  any  expression  Instance  may  be  separated  Into  at  least 
two  parts,  each  part,  furthermore,  corresponding  to  an  anchored 
sub-pattern  of  the  original  pattern. 

The  pattern  compiler  In  MACSYMA  seeks  out  anchors,  and 
successively  compiles  program  segments  to  remove  those  parts 
which  can  be  unequivocally  identified,  if  the  remaining  parts 
provide  no  anchor,  or  If  several  not  distinct  anchors  are 
provided,  the  compiler  will  not  be  able  to  take  advantage  of  Its 
oullt-in  knowledge,  in  some  cases,  warning  messages  will  be 
produced. 
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One  of  the  basic  design  decisions  concerning  the  internal 
format  of  MACSYMA  expressions  pervades  this  algorithm.  MACSYMA 
removes  inessential  operators  such  as  division  and  negation:  A/tt 
is  represented  internally  by  A*B**(-1),  and  -A  is  represented  by 
(-1)*A.  Reducing  all  arithmetic  operators  to  +  ,  *,  and  **  has  the 
disadvantage  of  causing  a  moderate  increase  in  the  size  of 
internal  representat ions,  but  has  the  overriding  advantage  of 
erasing  small  differences  in  appearance  which  might  tend  to 
complicate  the  matching  process.  (The  MACSYMA  Input  and  output 
rout i nes,  in  order  to  Improve  readability,  reintroduce  quotients, 
differences,  and  unary  minuses.)  Markov  algorithms  written  in 
Formula  Algol  seem  to  be  largely  concerned  with  juggling  these 
redundant  internal  notations,  a  confirmation  of  the  suitability 
of  our  design  decision,  (see  (23)  pp.  172-174) 

The  remainder  of  this  appendix  describes  in  detail  the 
methods  used  to  seek  out  anchors.  These  methods  vary  depending 
on  the  context,  so  that  an  anchor  within  a  sum  is  different  from 
an  anchor  within  a  product.  Although  we  have  tried  to  make  this 
description  as  clear  as  possible,  it  is  not  our  intention  that  a 
user  of  MACSYMA  read  this  as  a  prerequisite  to  using  the  pattern 
matching  system.  A  user  should  compose  patterns  in  the  inter¬ 
active  MACSYMA  environment,  and  by  viewing  the  explicit  actions 
of  the  patterns  themselves,  he  should  judge  their  suitability. 

This  is  similar  to  the  philosophy  of  other  parts  of  MACSYMA:  a 
user  will  rarely  know  £  Priori  whether  or  not  an  integration  can 
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be  performed  by  the  system,  or  whether  an  indicated  command  is 
powerful  enough  to  accomplish  his  task.  Although  it  Is  desirable 
to  describe  capabilities  in  a  clear  manner,  it  is  unreasonable  to 
restrict  the  capabilities  to  that  which  can  be  so  described. 

With  these  preliminaries,  we  can  define  precisely  what  is 
meant  when  a  pattern  £  matches  an  expression  £. 

I.  If  a  pattern  £  is  fixed,  then  it  matches  an  expression  £  if 
and  only  i f  £  -  £,  when  simplified,  is  0.  Of  the  simplification 
routines  in  MACSYMA,  the  general  ("advisable")  one  is  usually 
used.  When  coefficients  have  been  picked  out  of  an  expression  in 
the  previous  step,  canonical  rational  simplification,  which 
expands  expressions  and  combines  similar  terns,  is  used.  Note 
the  heavy  dependence  on  the  power  of  the  simplifier.  If  the  user 
has  (presumably  by  mistake)  told  the  simplifier  to  replace  an 
expression  A  by  a  larger  expression  which  has  A  as  a 
subexpression,  this  definition  may  become  circular.  We  assume 
that  no  such  errors  have  been  committed. 

II.  If  £  is  a  sum,  2a  ,  then  all  fixed  a  are  subtracted  from 

i  j 

£,  and  then  the  rest  of  the  a  are  examined  as  follows: 

i 

A*  If  a  is  a  product  with  more  than  one  unmatched  variable, 
i 

it  is  ambiguous.  Any  of  the  variables  might  match  the  whole 
expression.  Processing  such  a  pattern  will  cause  a  warning  to 
be  printed,  and  the  pattern  will  be  treated  as  in  E  below,  as 
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an  occurrence  of  the  specific  function  "MTIMES"  with  a  fixed 
number  of  arguments. 

9.  If  a(  Isa  product  of  a  declared  variable  *  and  a  fixed 

pattern  £  then  y's  predicate  Is  applied  to  the  coefficient  of 
1  In  £.  (The  definition  of  "coefficient"  used  here  may  be 
found  In  Chapter  3,  In  the  description  of  the  RATCOEF 
command, )  If  it  fails,  the  match  falls,  otherwise  It  proceeds. 
(That  Is,  y  Is  compared  to  the  coefficient  of  £  in  £.) 

C.  If  a(  is  an  unmatched  variable,  then  It  should  be  the  only 

unmatched  S|,  since  it  will  match  the  rest  of  the  expression. 

If  selectors  am  used,  there  might  be  more  than  one  remaining 
a|'  'n  Wh'ch  case  they  ml*h'  correctly  separate  out  the  rest 

of  the  expression  into  several  parts.  A  warning  Is  printed  In 
this  situation. 

0.  If  a(  is  an  exponentiation,  one  of  three  possibilities 

exists.  Either  the  base  is  fixed,  the  exponent  is  fixed,  or 

neither  Is  fixed.  (If  both  were  fixed,  a  would  be  fixed,  and 
thus  be  treated  under  i.)  1 

1.  The  base  is  fixed:  A  search  Is  made  for  an  exponential 

operator  with  the  given  base.  If  the  search  succeeds,  the 

pattern  for  the  exponent  Is  compared  to  a  's  exponent. 

i 

Here,  as  elsewhere.  If  the  comparisons  of  subexpressions 
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faM/  the  match  falls.  If  the  search  falls/  the  base  may 
occur  to  the  first  power.  If  the  base  Is  found  In  £,  then 
the  pattern  for  the  exponent  Is  compared  to  the  number  1. 
If  the  base  Is  a  sum  Itself,  It  Is  subtracted  from  £,  and 
the  pattern  for  the  exponent  compared  to  1. 

2.  The  exponent  Is  fixed:  A  search  is  made  for  an 
exponential  operator  with  the  given  exponent.  If  it 
succeeds/  the  pattern  for  the  base  Is  compared  to  a  ' s 
base.  If  the  search  fails  and  the  exponent  Is  a  negative 
Integer,  1  is  subtracted  from  £  and  the  pattern  for  the 
base  is  compared  with  1  (the  case  of  a  missing 
denominator).  Otherwise,  (the  exponent  is  not  a  negative 
integer)  the  pattern  for  the  base  Is  compared  with  0.  This 
means  that  the  pattern  a+l/b  (with  a  and  b  declared  TRUE) 
will  match  the  expression  X+l  with  a=X,  b-1,  and  will  match 
the  expression  X  with  a=X-l,  b~l.  The  pattern  a+b**2  will 
match  the  expression  X  with  b»0,  a*X. 

3.  Neither  is  fixed:  Any  exponentiation  is  searched  for. 
Exponentiation  is  treated  as  a  two-argument  function  with 
name  "MEXPT"  as  in  E  below. 

4.  If  an  exponentiation  being  searched  for  in  a  sum  is 
actually  the  only  item  left  in  the  sum  (e.g.  y**x  +  A  after 
A  has  been  matched  and  removed)  then  other  special  cases 
are  considered,  if  the  base  B  is  fixed,  then  B**E  matches  1 
I  f  B  0  and  E  matches  0.  If  the  exponent  E  is  fixed,  then 
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B  matches  0  If  E  Is  a  number  greater  than  0  and  B 
matches  0. 

E.  If  4|  Is  a  specific  function  (e.g.  SIN)  then  the  first 

occurrence  of  that  function  Is  searched  for.  The  arguments  of 
the  pattern  are  compared  with  the  corresponding  arguments  In 
the  expression,  and  a  check  is  made  to  assure  that  the  same 
number  of  arguments  appears  In  the  pattern  and  In  the  expres¬ 
sion.  If  all  the  component  matches  succeed,  a  ,  the  pattern, 
(now  fixed)  Is  subtracted  from  £.  1 

F.  If  a^  is  a  function  whose  name  Is  an  unmatched  variable, 

then  an*  function,  (possibly  .,  .,  or  ls  searched  for<  ^ 
treated  as  In  E. 

III.  If  n  is  a  product,  TTa^  then  the  sum  operations  (except  for 

ll-A  and  1 1 -B)  are  duplicated,  with  "divide"  replacing  "subtract" 

and  "product"  replacing  "sum."  since  products  within  products  are 

not  possible  with  the  MACSYMA  simplifier,  the  action  taken  in  II- 

A  or  ll-B  has  a  correlate  In  III  only  if  the  simp.  er  Is  turned 

off;  in  such  situations,  semantic  pattern  matches  will  not 
succeed  anyway. 

IV.  If  J)  Is  an  exponentiation,  then  a  is  treated  as  In  ll-D,  1, 

2,  and  A.  If  neither  the  base  nor  the  exponent  is  fixed,  (the 
situation  of  1 1 -0-3),  £  is  treated  as  follows: 

A*  1 f  £  is  1,  £  is  compared  to  1**0. 

B.  If  £  is  0,  £  is  compared  to  0**1. 
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c.  If  £  Is  not  an  exponentiation,  a  is  compared  to  f.J, 

0.  If  £  is  an  exponentiation,  the  respective  bases  and 
exponents  of  jj  and  £  are  compared. 

v.  If  fi  is  some  specific  function.  It  Is  treated  as  follows:  The 
function  name  in  fi  (e.g.  SIN)  must  match  the  leading  operator  In 
£.  The  respective  arguments  of  the  pattern  and  expression  are 
then  compared  and  a  check  is  made  that  the  same  number  of 
arguments  appears  In  the  pattern  and  in  the  expression,  if  all 
the  component  matches  succeed,  the  pattern  succeeds, 
vi.  If  fl  is  an  unspecified  function  whose  name  Is  unmatched.  It 
is  treated  as  In  V,  except  that  the  unmatched  function  name  of 
IS  compared  to  the  leading  operator  of  £. 

VII.  If  B  Is  an  atomic  unmatched  variable.  It  Is  compared  to  £. 

These  operations  may  be  nested  to  an  arbitrary  depth,  since 
comparing  a  pattern  and  an  expression  may  Invoke  comparisons  of 
subexpressions.  Furthermore,  this  algorithm  Is  exhaustive.  In  the 
sense  that  given  any  syntactically  valid  MACSTMA  expression,  a 
pattern  matching  process  will  be  defined  for  It. 

Appendix  il 

The  following  LISP  listing  o,  QUAD  uses  several  system  conven¬ 
tions  which  can  be  briefly  summarized  as  follows: 

All  user  variable-names  have  a  dollar  sign  prefixed  to 
them.  The  *KAR( ERRSETt . . . ) )  construction  serves  only  to  catch 
illegal  operations  or  ERRO's  and  return  MIL  in  such  instances. 
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MATCOEF(X/Y)  returns  the  coefficient  of  Y  in  X  as  found  by  the 
RATCOEF  of  chapter  3.  MEVAL(X)  is  the  MACSYMA  evaluator.  It 
substitutes  values  for  variables  in  the  expression  X,  evaluates 
the  result,  and  returns  a  simplified  expression  as  an  answer. 
RATSIMP(X)  rationally  simplifies  X.  RETLIST  returns  a  list  of  its 
arguments  and  their  values. 

The  GOOn  names  are  symbols  produced  to  meet  the  need  for 

unique  new  variable  names. 

(DEFUN  SQUAD 
(G0042  $X) 

( *KAR 

(ERRSET  (PROG  (G0043  G0044) 

(SETQ  G0043 

(MATCOcF  G0042 

(MEVAL  (QUOTE  ( (MEXPT  SIMP) 

$X 

2))))) 

(COND  ((MEVAL  (QUOTE  ( ( $NONZEROANDFREEOF) 

$X 

G0043) ) ) 

(SETQ  $A  G0043 ) ) 

((ERR))) 

(SETQ  G0042  (MEVAL  (QUOTE  (($RATSIMP) 

( ( MPLUS ) 

G0042 
( (MTIMES) 

-1 

G0043 

((MEXPT  SIMP) 

$X 

2))))))) 

(SETQ  G0044  (MATCOEF  G0042  $X)) 

(COND  ( ( SFREEOF  $X  G0044)  (SETQ  $B  G0044)) 
((ERR))) 

(SETQ  G0042  (MEVAL  (QUOTE  ((IRATSIMP) 

((MPLUS) 

G0042 

((MTIMES) 

-1 

G0044 

$X)))))) 

(COND  ( ($FREE0F  $X  G0042)  (SETQ  $C  G0042)) 
((ERR))) 

(RETURN  (RETLIST  $C  $B  $A  $X)))))) 
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Append i x  III 

This  appendix  considers  the  question  of  pattern  matching 
from  a  more  theoretical  standpoint,  it  answers  some  questions 
about  the  formal  power  of  pattern  matching  in  determining 
membership  of  an  expression  In  a  class/  and  the  ability  of  the 
pattern  match  to  uniquely  determine  the  values  of  the  variables 
in  the  pattern.  Many  of  these  results  may  seem  trivial  or 
obvious;  nevertheless,  they  are  not  expressed  elsewhere.  Some 
especially  trivial  results  are  (1)  Any  expressions  E,  synthesized 
by  MACSYMA  can  be  matched  by  a  pattern,  namely,  the  pattern  E. 
and  (2)  Any  expression  in  MACSYMA  can  be  completely  decomposed  by 
some  pattern:  By  using  explicit  matches  for  operators  (as  in 
section  4,  line  C6),  every  single  component  of  any  expression  can 
be  given  a  name.  (Since  we  usually  seek  to  define  general 
patterns,  such  explicit  matches  are  rarely  of  great  use.) 

D.e.f LaUJLaO »  a  pattern  match  program  (PMP)  is  a  program  produced 
by  the  implementation  of  the  algorithm  described  in  Appendix 
I,  given  a  valid  MACSYMA  expression. 

Theorem  2.111.1.  A  PMP  for  a  finite  pattern  is  finite  in  speci¬ 
fication. 

Proof..  A  finite  pattern,  written  as  a  tree,  has  a  finite  number 
of  nodes.  The  algorithm  of  Appendix  i  traverses  the  tree 
once,  emitting  a  finite  number  of  finite  steps  at  each  node 
(in  practice,  fewer  than  3  LISP  MS-express i ons"  per  node). 

The  algorithm  terminates  when  the  tree  has  been  traversed.  (A 
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more  rigorous  proof  is  possible,  but  would  require  a  detailed 
analysis  of  the  correctness  of  our  programming  implemen¬ 
tation.  This  could  be  done  by  case  analysts,  as  described  in 
(28).) 

We  will  assume,  for  the  remainder  of  this  appendix  that 

all  PMPs  are  defined  for  finite  patterns,  and  are  therefore 
finite  in  size. 


■ThMrem  2.m.2.  A  PMP  always  terminates  given  finite  expressions 
for  each  of  Its  argumentr  If  its  predicates  always  terminate,  and 
the  evaluator  (MEVAL)  always  terminates  (with  a  finite  result)  on 
finite  exoress Ions. 


££flaf.  A  PMT  Is  a  finite  non-looping  sequence  of 
terminates,  since  It  is  either  an  application  of 


steps.  Each  step 
a  predicate,  an 


evaluation  of  an  expression,  or  the  extraction  of  the  ith 


argument  of  an  n-ary  function.  The  last  of  these  clearly 
terminates  since  i  and  n  are  finite  by  hypothesis,  and  the  first 


two  terminate  by  hypothesis. 


Since  patterns  are  themselves  predicates.  It  is  possible 
(and  often  useful)  to  use  them  recursively,  in  such  cases 
termination  will  be  difficult  to  guarantee  by  this  theorem. 
Although  this  theorem  states  sufficient  conditions  for  termina¬ 
tion,  these  conditions  are  not  necessary,  since,  for  example, 
non-terminating  predicates  may  be  present  in  a  pattern,  but  may 
never  be  applied  if  the  pattern  fails  first. 
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For  the  remainder  of  this  appendix,  all  PMP's  are  assumed 
to  be  terminating.  Thus  a  PMP  divides  the  set  of  finite  algebraic 
expressions  into  two  classes,  P,  the  expressions  which  satisfy 
the  pattern,  and  N,  those  which  do  not. 

Now,  a  result  which  determines  a  theoretical  limit  (but  not 
necessarily  the  best  limit)  on  the  power  of  pattern  matching: 

Jlt&Qr.fim  2.1  I  1.3.  Unless  N  or  P  is  empty,  there  is  a  MACSYMA  ex¬ 
pression  P  in  P  which  is  functionally  equivalent  to  a  MACSYMA 
expression  n  in  N. 

iiLGflf.  Let  us  assume  for  the  moment  that  the  MACSYMA  simplifier 
Is  unaware  of  special  angle  simplifications,  and  let  a  PMP 
program  pass  only  expressions  which  match  zero  (0).  Neither  P 
nor  N  Is  empty,  so  this  theorem  asserts  there  is  an  element 
In  N  equivalent  to  zero.  We  can  show  there  are  many.  One  of 
them  Is  COS (PI / 2 ) .  To  see  if  COSCPI/2)  matches  0,  (see  the 
first  line  of  the  algorithm.  Appendix  I)  we  simplify 
COS (PI/2)— 0.  The  result,  COS ( PI /2) ,  is  not  identically  the 
expression  "0"  and  therefore  Is  in  N. 

One  might  blame  the  MACSYMA  simplifier  for  this 
inadequacy,  except  for  the  following  lemma,  which  proves  that 
the  simplifier  cannot  be  made  sufficiently  adequate. 
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Lemma.  (Richardson,  (36),  see  (6)  also)  Let  R  be  the  set  of 
expressions  generated  by 

(i)  the  rational  numbers,  and  the  real  numbers  pi  and 
log  2, 

( i i )  the  variable  x, 

(iii)  the  sine,  exponential  and  absolute  value  func¬ 
tions. 

Then  if  E  is  an  expression  in  R,  the  predicate  "E  is 
equivalent  to  0"  is  recursively  unsolvable. 

Since  all  of  these  operations  and  constants  are  permissible 
in  MACSYMA,  there  is  no  computation  which  can  exclude  0- 
valued  functions  from  N.  Furthermore,  since  "0"  can  be  added 

to  any  pattern  whatsoever,  the  same  analysis  holds  for  any 
PMP . 

This  concludes  the  proof  of  theorem  2.1  I  1.3. 

Our  only  hope  is  that  some  sub-domains  within  R  have  less 
disastrous  consequences  for  pattern  matching.  Such  is  the  case. 
Let  us  use  the  convention  that  A, , . . . , Am  are  MACSYMA  variables 
which  have  been  declared  TRUE,  and  X,,...,Xn  are  constants.  (This 
convention  may  seem  odd,  but  consider  that  AX+B  as  a  pattern  has 
variables  A  and  B  and  constant  X.) 

Since  we  are  restricting  the  domain  of  expressions  handed 
to  PMP's,  we  will  be  affected  by  the  power  of  the  simplifier. 

Thus  while  E  *  X  +  C0S(PI/2)*SIN(X)  does  not  look  like  a 
polynomial  in  X,  a  suitable  simplifier  will  transform  E  into  the 
equivalent  expression  X,  which  Is  a  polynomial  in  X.  All  expres¬ 
sions  mapped  into  a  domain  D  by  a  simplifier  s,  are  members  of 
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the  class  Ds.  No  rigorous  definition  of  Ds's  will  be  attempted, 
since  the  simplifier  in  MACSYMA  defies  simple  analyses,  and  in 
any  case,  it  can  be  altered  by  the  user. 

I.  hear  am  2.MI.4.  Let  D  be  the  domain  of  polynomials  in  any  finite 
number  of  var  i abl es  (X;J  with  integer  or  symbolic  coef¬ 
ficients.  A  pattern  consisting  of  any  (expanded)  member  of  D 
with  variable  coefficients  {A;}  will  match  uniquely  any 
member  of  Ds. 

£££&£•  An  expanded  member  of  D  will  look  like 

A  PMP  for  this  pattern  win  consist  of  a  finite  set  of  calls 
to  the  coefficient-finding  routine,  which  will  assign  to  each 
A,  the  coefficient  of  X,  '  * . . .  ♦X^'  .  These  coefficients 

can  be  extracted  because  this  representat ion  of  polynomials 
is  canonical,  and  the  coefficients  are  obviously  unique  in 
any  expression  (regardless  of  its  original  form)  which  can  be 
transformed  into  an  equivalent  polynomial  in  X1,...,Xn. 

JAe.ar.em  2.1  11.5.  Let  D  be  the  domain  of  rational  expressions 
(i.e.,  ratios  of  polynomials  as  in  theorem  4).  A  pattern 
consisting  of  the  ratio  of  two  expanded  polynomials  with 
variable  coefficients  will  match  uniquely  any  member  of  Ds. 
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£L2fi.f..  The  numerator  is  matched  as  in  theorem  4,  and  the 

denominator,  which  appears  as  a  polynomial  raised  to  the  -1 
power,  is  matched  as  in  theorem  4.  The  absence  of  a 
denominator  will  cause  the  pattern  for  the  denominator  to  be 
matched  against  1. 

These  results  can  be  extended  ir.  various  directions,  but 
results  become  more  specialized  and  less  illuminating.  For 
example, 

T.h&arer-I  2.III.6.  Let  f  be  an  n-ary  function  with  no  simplifica¬ 
tion  rules  in  the  simplifier  s.  Let  Ds  be  the  Ds  of  the 
previous  theorem,  and  fd^  elements  of  Ds.  Let  Df  be  the  set 
of  expressions 

d,*f(d1,...,dn)+dn4l  . 

Then  an  expression  in  Df  with  variable  coefficients  will 
match  uniquely  any  member  of  Dfs, 

£naaf.  The  single  occurrence  of  f  can  he  found,  and  its  "coef¬ 
ficient"  d„  and  "constant  term"  d„,t  can  be  matched  as  in 
theorem  5. 

Typical  of  statements  which  are  true,  but  are  of  only 
limited  interest  is:  Let  Ds  be  the  Dfs  of  the  previous  theorem, 
and  let{dj)  be  in  Ds.  Let  E  be  the  set  of  expressions  df'  .  The 
pattern  matches  uniquely  any  member  of  Es,  but  the  uniqueness  is 
imposed  by  the  match  algorithm.  Thus  1  is  a  member  of  Es,  as 
1**0,  but  the  possibilities  1**1  or  x**Q  are  not  considered. 
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When  we  restrict  ourselves  to  matching  expressions  composed 
over  classes  for  which  canonical  forms  exist,  as  in  theorem  4, 
quite  neat  results  can  be  obtained  if  the  simplifier  is  able  to 
compute  these  canonical  forms.  For  many  areas  of  interest, 
canonical  form  algorithms  do  not  exist,  yet  being  able  to 
recognize  menibers  of  particular  classes  within  the  confines  of  a 
simplifier  can  still  be  useful.  For  example,  recognizing  the 
parameters  of  a  differential  equation,  even  if  it  can  be  done 
only  in  some  standard  form,  is  useful,  even  if  some  other,  rarely 
encountered,  but  equivalent  form  is  not  recognized  at  all. 

In  comparing  various  systems  in  this  context,  the  principal 
point  is  that  for  the  same  domain  D,  the  ability  of  the 
simplifier,  s,  to  reduce  an  expression  to  essential  components, 
strongly  influences  the  size  of  the  set  Ds.  MACSYMA's  simplifiers 
(the  "ordinary"  one  which  can  be  modified  by  the  user,  the 
RATSIMP  rational  simplifier,  and  the  RADCAN  radical  canonical 
simplifier  of  Chapters  3  and  5),  provide  a  range  of  possibilities 
larger  than  that  of  any  other  existing  system.  While  any  system 
which  is  in  theory  equivalent  to  a  Turing  machine  could  in  theory 
do  as  well  (eventually)  as  MACSYMA,  or  even  better,  theorem  3 
provides  a  bound  on  their  theoretical  capabilities  just  the  same. 
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Chapter  3  -  Extending  the  Power  of  the  Rational 
Funct i on  Fac i 1 i t ies 

This  chapter  concerns  the  practical  implications  of  what  I 
believe  to  be  the  most  significant  design  decision  in  MACSYMA. 
MACSYMA  was  designed  with  the  intention  of  nsil  necessarily 
restricting  its  components  to  the  same  data  representation.  The 
rational  function  package  embodies  the  essentials  of  a  special 
data  type  which,  by  suitable  treatment,  has  yielded  a  number  of 
new  results.  These  results  include  particularly  powerful  tech¬ 
niques  for  extracting  coefficients  (section  3.3),  for  substi¬ 
tution  (section  3.5),  and  for  solving  for  a  variable  in  an 
expression  (section  3.6). 

Since  a  number  of  other  current  systems  (e.g.  REDUCE  (19)) 
also  include  analogous  special  rational  function  representations, 
the  new  ideas  and  techniques  discussed  here  could,  no  doubt,  be 
implemented  elsewhere  with  relative  ease. 


By  using  the  rational  function  representation  (as  opposed 
to  the  general  representation),  extremely  fast  processing  is 
possible.  For  typical  calculations  which  can  be  done  either  way, 
the  rational  function  representation  can  easily  reduce  the  time 
requirements  by  a  factor  of  five  or  ten,  becoming  far  more 
efficient  as  the  problem  increases  in  size.  This  in  itself  can  be 
a  significant  asset.  In  order  to  make  this  point  more  concrete, 
and  to  demonstrate  how  MACSYMA  compares  to  similar  efforts  else¬ 
where,  some  timing  information  has  been  compiled.  Only  the 
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crudest  efforts  at  making  the  cross-system  comparisons  truly 

comparable  have  been  att.mpted;  no  doubt  an  extensive  study  could 

be  conducted  in  balancing  the  differences  of  word-length,  CPU 

cycle  time,  memory  access  time,  size  of  storage,  CPU  instruction 

set,  etc.  The  timings  in  table  3.1  are  for  the  calculation  of 

the  first  10  polynomials  in  the  "f  and  g"  series,  the  details  of 

which  may  be  found  in  (11)  or  (31).  The  calculation  is  of  two 

sets  of  polynomials  in  sigma,  mu  and  epsilon,  defined  recursively 

in  terms  of  each  other  and  derivatives  of  lower  order  terms.  The 

calculation  can  be  indicated  in  MACSYMA's  rational  function 

representation  through  the  following  input: 

XI:  RAT ( -S I GMA* (MU  ♦  2*EPS))$ 

X2:  RAT ( EPS-2*S I GMA* *2) $ 

X3:  RAT ( -3*MU*S i GMA) $ 

F[0J  ;  RAT  ( 1)  $ 

G  [0]  :  RAT (  0)  $ 

F  H 1  :■  -MU*G  [  i -1J  ♦  X1*DI FF(F f I -11 , EPS) 

♦  X2*DiFF(F[i-l]/SiGMA) 

♦  X3*DiFF(F[i-l],MU)$ 

G  [  I  ]  F[i-1]  ♦  Xl*DiFF(G|  i  - 1 1 ,  EPS ) 

♦  X2*DIFF(G[i-l],Si GMA) 

♦  X3*DiFF(G[l-i],MU)$ 

r  CIO] y 

G  [10]  (d 

Timings  for  systems  other  than  MACS YMA  are  interpolated  from  (11) 
or  from  conversations  with  the  authors  of  various  systems 
presented  at  SYMSAM/2  (1). 
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Table  3.1  -  A  Cross-System  Timing  Comparison 


System 

Computer 

T  i  me 

Adjusted  Notes 

( sec) 

Time* 

Mach i ne- Language 

Sys  terns 

PM 

IBM  7091* 

4.4 

18 

FORMAC 

IBM  7094 

39.0 

156 

SAC-1 

CDC  1604 

25.8 

26 

SYMBAL 

CDC  1604 

52.2 

53 

SYMBAL 

CDC  6600 

4.3 

65 

CAMAL 

ATLAS-2 

2.0 

4 

(1) 

Systems 

Written  in 

H  i  gher  ■ 

■Level  Languages 

using  bpecial  Rational 

1  Functi 

'on  Representations 

REDUCE  2 

IBM/360-67 

4.5 

45 

(2) 

10. 

100 

(3) 

PDP-10 

10.5 

55 

(4) 

1  AM 

PDP-10 

152 

760 

(5) 

MACSYMA's  rational 

function  routines 

PDP-10 

14.8 

30 

(6) 

Systems 

Wr i t  ten  i n 

H i gher 

Level  Languages 

us  i  ng 

General  (Tree)  Rep 

resentat  t  ons 

Korsvold's  System 

IBM  7094 

119.0 

475 

MACSYMA's  general 

representation 

PDP-10 

76. 

152 

(7) 

85 


Notes  for  Table  3.1 

*For  a  direct  comparison/  we  have  used  the  following  somewhat 
controversial  speed  factors:  CDC  1604  =  1,  PDP-10  (with  2.8 
microsecond  memory)  =  2,  ATLAS-2  =  2,  IBM  7094  =  4/  PDP-10  (with 
1  microsecond  memory)  =  5/  IBM  360/67  ■  10#  CDC  6600  «  15.  In 
addition/  the  notes  must  be  taken  into  account  in  computing  the 
adjusted  time.  REDUCE2/  Korsvold's  System/  and  MACSYMA  are  all 
written  in  LISP/  and  are  subject  to  variations  depending  on  the 
efficiency  of  the  underlying  language  implementation. 

(1)  A  particularly  rough  interpolation;  the  actual  time  was  7.4 
seconds  for  F  and  G  to  index  19.  CAMAL  uses  a  representation 
which  packs  a  great  deal  of  information  in  a  single  node;  it  thus 
uses  less  space/  and  less  pointer-fol lowing  time  than  the  other 
systems  listed  here. 

(2)  Using  Stanford  University  LISP. 

(3)  Using  IBM's  Scratchpad  LISP  which  is  slower  than  Stanford 
LISP/  since  it  packs  two  addresses  in  a  32-bit  word/  thus 
requiring  shifts  to  adjust  the  addresses. 

(4)  Using  1  microsecond  memory. 

(5)  Using  1  microsecond  memory.  1AM  is  written  in  AMBIT/L/  and 
is  interpreted/  rather  than  compiled. 

(6) /  (7)  Using  2.8  microsecond  memory. 

These  times  (and#  no  doubt/  other  LISP  times)  can  be 
decreased  by  some  40  percent  by  methods  unrelated  to  the 
algorithms:  By  using  a  larger  core  allocation/  LISP  garbage 
collection  time  can  be  reduced;  also#  a  cleverer  LISP  arithmetic 
statement  compiler  (now  being  implemented)  would  reduce 
calculation  time  further. 
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3.0  An  Introduction  and  a  "Political"  Digression 
Mo  es,  in  (34),  describes  a  spectrum  of  attitudes  towards 
algebraic  manipulation  ranging  from  the  "radical"  to  the 
"conservative."  According  to  this  classification,  a  radical 
system  will  transform  a  user-supplied  expression  into  an  internal 
format  which  consists  of  an  encoding  of  the  expression  in  a 
special  unique  simplified  form.  This  drastic  transformation 
generally  destroys  superficial  resemblances  between  the  input  and 
output.  The  only  attribute  necessarily  preserved  is  the 
functional  value  of  the  expression.  Polynomial  and  rational  func¬ 
tion  systems  generally  fall  in  the  "radical"  category.  The 
contrasting  "conservative"  approach  does  almost  nothing  but  that 
which  is  specified  by  the  user;  it  keeps  the  internal  form  as 
nearly  the  same  as  the  external  form  as  is  possible,  and 
generally  accepts  a  wide  variety  of  expressions  (wider  than  poly¬ 
nomials  and  rational  functions). 

The  top-level  ("liberal"  in  Moses*  terminology)  "general" 
simplifier  and  evaluator  in  MACSYMA  takes  a  stance  in  the  middle. 
!t  has  some  built-in  rules  (e.g.  concerning  zero  and  one, 
collecting  terms)  and  by  ordering  terms  in  sums  and  products, 
does  a  fair  job  of  simplifying  a  large  class  of  expressions.  Its 
importance  1 i es  in  the  fact  that  it  allows  certain  subsystems  to 
explore  the  far  reaches  of  the  "political"  spectrum.  Because  of 
the  conjunction  of  different  approaches,  radical  simplification 
algorithms  can  be  applied  to  expressions  which  would  not 
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ordinarily  be  considered  proper  inputs.  For  example,  the  ability 
2X  X 

to  manipulate  e  ♦  2e  ♦  1  as  a  quadratic  in  e  (and  apply  poly¬ 
nomial  'radical  processing)  is  quite  useful,  even  though  the  ex¬ 
pression  is  not  quite  fair  game  for  ordinary  polynomial  systems. 
MACS i MA  is  capable  of  factoring  the  above  expression  into 
X 

(e  +1)  ,  and  treating  it  as  a  polynomial  for  various  purposes; 
however,  it  is  also  capable  of  noticing  that  eX  can  reduce  to  y 
when  x= log( y) .  Polynomial  or  rational  function  systems  are  rarely 
aware  of  such  possibilities  in  their  data. 

This  chapter  discusses  the  "radical"  data  handling 
facilities  of  MACSYMA,  and  their  relation  to  the  MACSYMA  command 
level.  In  one  particular  instance  (the  SOLVE  command)  we  show  how 
radical  and  conservative  handling  of  different  parts  of  the  same 
expression  can  lead  to  an  end  result  which  could  be  produced  with 
either  approach  alone  only  with  great  difficulty.  Other  commands 
where  rational  simplification  or  other  radical  approaches  are 
essential  to  programming  effective  algorithms  are  also  discussed. 

By  an  unfortunate  coincidence  in  terminology,  we  will  use 
the  word  "radical"  in  two  senses.  In  one  case,  we  will  discuss  a 
class  consisting  of  algebraic  extensions  adjoined  to  the  field  of 
rational  functions.  This  class  is  generally  called  the  class  of 
radical  expressions  (in  the  sense  that  a  square  root  is  a 
radical).  In  the  second  case,  our  approach  to  simplifying 
radical  expressions  is,  in  Moses'  terminology,  radical  (i.e. 
drastic). 
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In  this  and  later  chapters,  the  algorithm  and  the  command 
used  for  invoking  the  algorithm  used  to  slmpl ify  radical  expres- 
iiani  will  be  referred  to  as  RADCAN.  RADCAN  and  the  two  commands 
to  be  described  in  section  3.1,  RATSIMP  and  FULLRATSIMP  are  all 
classified  as  radical  (i.e.  drastic)  simplifiers. 

3.1  Basic  Rational  Function  Commands 
In  order  to  clarify  the  discussion,  it  is  necessary  to  dis¬ 
tinguish  between  the  two  major  internal  forms  for  expressions  in 
MACSYMA.  Ordinary  MACSYMA  form  is  a  delimiter  prefix  form  which 
is  typical  of  many  list-processing  Implementations  of  algebraic 
manipulation  systems.  For  example,  3x2  would  be  represented 
(glossing  over  inessential  details)  as  (times  3  (expt  x  2)),  and 
x  +  y  as  (plus  x  y).  By  contrast,  the  .canonical  rational  expression 
(CRE)  form  In  MACSYMA  is  an  internal  form  especially  suitable  for 
rapid  manipulation  of  sparse  polynomials  and  rational  functions. 

2 

In  CRE  form,  3x  Is  represented,  (again,  glossing  over  details) 
as  (x  2  3).  The  first  element  of  the  list  is  the  variable,  the 
second  is  its  highest  exponent,  and  the  third,  the  coefficient  of 

2 

the  just  preceeding  exponent.  Thus  6x  *4  is  represented  as  (x  2  6 

0  4),  and,  allowing  coefficients  themselves  to  be  polynomials, 

2 

x  Y+7xz  is  (x  2  (y  1  1)  1  (2  i  7)).  Since  (y  1  (x  2  1)  0  (x  1  (Z 
1  7)))  is  an  equivalent  CRE  representat i on,  it  should  be  clear 
that  the  ordering  of  variables  must  be  specified  to  insure  that 
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equivalent  CRE's  are  Identical,  that  Is,  they  are  In  cannnlnal 
form. 

CRE's  in  general  represent  rational  expressions,  that  is, 
ratios  of  polynomials,  where  the  numerator  and  denominator  have 
no  common  factors,  and  the  denominator  is  positive.  Thus  a  CRE 
has  three  essential  parts:  a  variable  list  (VARLiST),  specifying 
the  ordering  of  the  variables,  and  two  polynomial  parts. 

With  these  preliminaries,  we  can  describe  the  actions  of 
the  rational  function  commands. 

RATVARS(a,b, . . . )  orders  the  variables  listed  in  its  argu¬ 
ment  list  on  a  global  variable  list  (VARLIST)  so  that  the 
rightmost  element  of  the  list  a,b,...  will  be  the  main  variable 
of  future  rational  expressions  in  which  it  occurs,  and  the  other 
variables  will  follow  in  sequence,  if  a  variable  is  missing  from 
the  RATVARS  list,  it  will  be  given  lower  priority  than  the 
leftmost  element.  If  several  variables  are  missing,  they  will  be 
ordered  by  the  MACSYMA  function  GREAT,  which  uses  an  implementa¬ 
tion  of  the  ordering  algorithm  described  in  (34).  The  arguments 

to  RATVARS  can  be  either  variables  or  non-rational  functions 
(e.g.  S I N( X ) ) . 

RATS  I MP( EXP)  rationally  simplifies  the  expression  EXP. 

That  is,  EXP  is  converted  into  a  single  fraction,  whose  numerator 
and  denominator  are  polynomials  over  the  integers,  with  no  common 
factors.  EXP  is  written  in  a  recursive  form:  a  polynomial  in  the 
main  variable  whose  coefficients  are  polynomials  In  the  next- 
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higher-order  variable/  .../  whose  coefficients  are  integers.  This 
is  accomplished  by  converting  EXP  into  CRE/  and  then  converting 
back  to  ordinary  MACSYMA  form  for  display. 

For  example: 


CC1)  (X**2-Y**2)*(Z**2+2*Z)/((X+Y)*W)@ 

2  2  2 

con 

W  (Y  ♦  X) 

( C2)  RATS  I MP( D1 ) @ 

2 

(X  -  Y)  Z  +  (2  X  -  2  Y)  Z 
(D2)  .  . 

W 

C C 3 )  RATVARS (X) $ 

CC4)  RATS  I M  PC  D 1 ) @ 

2  2 
x  (Z  ♦  2  Z)  -  Y  Z  -2YZ 
(D4)  . 

W 


FACTORC EXP)  factors  the  expression  EXP  into  factors 
irreducible  over  the  Integers.  If  EXP  is  a  rational  expression 
(with  a  denominator  not  1)  both  numerator  and  denominator  are 
factored.  If  FACTORFLAG  is  set  to  TRUE/  the  integer  multiplier/ 
if  any#  Is  factored  also.  The  algorithm  can  be  used  to  factor 
polynomial  in  any  number  of  variables.  GFACTOR(EXP)  factors 
polynomials  over  the  Gaussian  integers. 

For  example/ 

CCS)  FACTORC X**6+  D @ 

2  4  2 

<D5)  (X  +  1)  (X  -  X  +1) 
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SQFR(EXP)  is  similar  to  FACTOR  except  that  the  polynomial 
factors  are  "square-f ree"  that  is,  have  no  multiple  roots.  This 
algorithm,  which  is  also  used  by  the  first  stage  of  FACTOR, 
utilizes  the  fact  that  a  polynomial  has  in  common  with  its  nth 
derivative  all  its  factors  of  degree  >  n.  Thus  by  taking 
derivatives  with  respect  to  each  variable  in  the  polynomial,  all 
factors  of  degree  >  1  can  be  found.  Several  special  cases  are 
also  factored,  including  the  removal  of  polynomial  contents. 

PARTFRAC( EXP, VAR)  expands  the  expression  EXP  in  partial 
fractions  with  respect  to  the  main  variable,  VAR.  The  algorithm 
employed  is  based  on  the  fact  that  the  denominators  of  the 
partial  fraction  expansion  (the  factors  of  the  original 
denominator)  are  relatively  prime.  The  numerators  can  be  written 
as  linear  combinations  of  denominators,  and  the  expansion  falls 
out. 

(C6)  PARTFRAC( X/ (X**2-1),X)@ 


1  1 

(06)  . ♦ - 


2  X  -  2  2  X  ♦  2 

3.2.  Contagious  CRE  Commands 

The  above  commands  represent  no  new  capabilities;  MATHLAB 
(29)  has  almost  identical  facilities,  although  its  internal 
equivalent  of  our  CRE's  is  less  efficient  for  sparse  polynomials. 
Other  systems,  by  limiting  their  universe  of  discourse  to 
canonical  representations,  make  the  explicit  RATSiMP  commands 
unecessary.  Nevertheless,  an  algorithm  equivalent  to  RATSIMP 
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must  be  present  In  order  to  maintain  the  canonical  represen- 
tations  during  a  computation. 

The  commands  In  this  and  the  following  sections  represent 

significant  departures  from  the  usual  use  of  rational  function 
routines. 


RAT(EXP)  Is  Indistinguishable  on  command  level  from 
RATSIMP;  however,  RAT  Its  Internal  result  In  rational 

function  (CRE)  form,  so  that  operations  used  by  the  rational 
function  commands  described  here  can  be  more  rapidly  performed  o 
it.  Furthermore,  any  time  the  user  adds  to  or  multiplies  by  a 
CRE,  the  result  Is  a  CRE.  That  Is,  the  CRE  form  Is  "contagious." 
This  enables  a  user  to  easily  force  his  entire  calculation  to  be 
done  In  CRE  form  by  converting  one  of  his  inputs  Into  CRE  by 
simply  multiplying  by  RAT<1).  Some  problems  require  excessive 
amounts  of  storage  and/or  time  If  Intermediate  results  are 
converted  back  Into  prefix  form  at  each  step  of  the  calculation. 
The  RAT  facility,  by  being  integrated  Into  the  simplifier, 
permits  a  user  to  compose  a  program  and  try  It  out  (without  any 
changes)  on  ordinary  prefix  form  arguments  oj;  on  CRE  arguments. 

In  this  manner  It  is  simple  to  compare  the  timing  of  "general" 
versus  CRE  methods  on  the  same  task.  This  very  often  demonstrates 
that  CRE  methods,  when  appropriate,  are  much  faster. 


level 


RATD I SREP( EXP) ,  which  appears  to  do 
changes  Its  argument  from  rational 


nothing  on  the  command 
function  form  (CRE)  to 
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ordinary  MACSYMA  form.  This  is  sometimes  necessary  in  order  to 
use  some  of  the  other  MACSYMA  commands.  If  RATDISREP  is  not 
given  a  CRE  for  an  argument,  it  does  nothing. 

3.3.  The  Rational  Coefficent  Program 
RATCOEFCEXP, PART)  returns  the  coefficient,  C.  of  the  ex¬ 
pression  PART  In  the  expression  EXP.  c  will  be  free  (except 
possibly  In  a  non-rational  sense)  of  the  variables  In  PART.  If 
no  coefficient  of  this  type  exists,  zero  will  be  returned. 
RATCOEF  will  give  reasonable  answers  to  reasonable  requests,  and 
will  often  produce  reasonable  answers  to  poorly  stated  requests. 
Generally,  when  PART  Includes  a  or  a  "/",  results  may  seem 
odd.  (see  lines  D7,  D8,  DIO,  and  Dll  In  the  examples  to  follow). 
Since  EXP  is  rationally  simplified  before  It  is  examined,  coef¬ 
ficients  may  not  appear  quite  the  way  they  were  envisioned.  The 
effect  of  RATCOEF  should  be  clarified  by  the  following  examples. 

(Cl)  S: A*B*X**2+B*X+2*X+5@ 

2 

(DD  A  B  X  +  BX  +  2X+5 

(C2)  RATCOEF(S,X) 


(D2) 


B  ♦  2 


( C3)  RATC0EF(S, A*B)@ 
(D3) 

(C4)  RATC0EF(S, B)@ 


(04) 


AX  ♦  X 


( C5 )  RATC0EF(S, 2*X)@ 

B  ♦  2 

(05)  . 


2 
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(C6)  RATC0EF(5,B/2)@ 

2 

CD6)  2  A  X  ♦  2  X 

(C7)  RATCOEF( A*X+B*X+C, A+B)  @ 

(D7)  X 

(C8)  RATCOEF (3*A+2*B,A+B)@ 

( D8)  2 

(C9)  RATCOEF(S,-A)@ 

2 

( D9 )  -  B  X 

(CIO)  RATCOEF ((A*B*C)/D,  B/D ) @ 

(DIO)  A 

(Cll)  RATCOEF (3*A/D+A/D**2,  A/D**?)@ 
(Dll)  0 


Let  us  first  define  RATCOEF ( EXP,  PART)  where  EXP  is  a 

k 

polynomial  and  PART  has  the  form  v  for  v  a  variable,  k  a  number. 

This  case  is  clear:  we  expand  EXP  as  a  CRE,  and  pick  off  the 

k  k 

coefficent  of  v  .  If  there  is  no  occurrence  of  v  ,  the 

coefficent  is  0.  If  EXP  is  not  a  polynomial,  but  a  ratio  of 

polynomials,  then  we  must  make  a  decision  about  how  to  treat 

occurrences  of  v  in  the  denominator. 


Let  EXP  “num/denom,  where  num  ■  2a  v  .  If  the  coefficient 

I  i 

of  v  ,  namely  a  ,  is  zero  or  if  a  /denom  depends  on  any  variable 
i  i 

in  the  original  PART,  then  the  response  is  zero.  Otherwise  the 

response  is  a  /denom. 
i 
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RATCOEF  of  a  product  can  be  defined  recursively  as 

follows.  Consider  RATC0EF( EXP, PART) .  If  PART  - 

n  n  n 

1  2  k 

v  *v  *...*v  ,  then  RATCOEF ( EXP, PART)  = 

12  k 

n  n  n 

k  1  k-1 

RATCOEF ( RATCOEF ( EXP, v  ),v  ). 

k  1  k-1 

If  PART  *  A/B  then  RATCOEF ( EXP, PART)  ■  RATCOEF( EXP*B, A) . 

If  PART  «  -A,  RATC0EF( EXP, PART)  -  RATC0EF( -EXP, PART) . 

If  PART  B  ^  Aj  ^Possibly  after  removing  multipliers,  as 

above),  then  EXP  is  divided  by  PART  with  respect  to  the  main 
variable  in  PART,  if  the  quotient  depends  on  any  variable  In  the 
original  PART,  the  response  is  zero.  Otherwise  the  answer  is  the 
quotient. 

The  coefficient  produced  in  this  manner  may  depend,  in  the 
last  case,  on  the  ordering  of  the  variables  within  EXP.  For 

2  2 

example,  the  coefficient  of  (Y+Z)X  in  Z  X  +(Y+Z)X+A  is  clearly  1. 

The  similar  problem  of  finding  the  coefficient  of  XZ+XY  in 

2  2  2  2 
X  Z  +XZ+XY+A  yields  the  answer  0,  since  X  Z  +XY+XZ+A  divided  by 

2 

XZ+XY  is  XZ+1,  with  remainder  -X  YZ+A.  The  quotient  depends  on  X, 
and  thus  the  coefficient  is  taken  to  be  zero. 

This  illustrates  both  the  ability  of  the  user  to  ask  for 
coefficients  of  sums,  and  the  ability  of  RATCOEF  to  sometimes 
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answer  correctly.  We  could  have  defined  RATCOEF  only  for 
products/  but  It  seems  more  in  keeping  with  the  spirit  of  an 
interactive  system  to  avoid  such  restrictions  on  the  user.  Note 
that  if  the  user  were  disappointed  with  the  answer  0  to  the  above 
request/  first  executing  RATVARS(X)  would  correct  the  situation. 

In  summary#  RATCOEF  will  find  the  coefficient  of  PART  when 
PART  is  a  factor  of  the  expression/  or  of  some  part  of  the  ex¬ 
pression  such  that  the  other  factor  has  none  of  the  same 
var i abl es . 

The  returned  value  is  in  CRE  form. 

An  alternative  to  RATCOEF  is  available  in  situations  where 
its  generality  is  not  needed.  The  COEFF  command  can  operate  on 
CRE  forms  or  on  ordinary  MACSYMA  forms  which  have  been  expanded. 
COEFF ( EXP, VAR/ POWER)  will  extract  the  coefficient  of  VAR**P0WER 
(where  POWER  may  be  0)  from  EXP.  COEFF  returns  a  CRE  form  if  and 
only  if  it  is  given  a  CRE  form. 

3.4.  Simple  Extensions  to  Rational  Simplification 
FULLRATSIMP(EXP)  is  an  expanded  version  of  RATSIMP  which 
is  recursive  on  the  arguments  of  non-rctional  functions.  It  also 
removes  zero  exponents,  and  converts  forms  like  (x**y)**z  to 
x**(y*z).  Although  these  last  two  operations  are  generally 
performed  by  tie  simplification  program,  FULLRATSIMP  must 
repeatedly  simplify  the  resuits  of  such  transformations  until  no 
more  rational  simplifications  can  be  made.  FULLRATSIMP  is  no  more 
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time-consuming  than  RATSiMP  if  EXP  Is  an  algebraic  expression 
with  no  non-rat i onal  functions.  FULLRAT ( EXP)  Is  a  program  which 
operates  similarly,  but  allows  the  user  to  specify  a  varlist 
as  does  RAT. 

A  more  extensive  expansion  of  the  concept  of  global  simpli¬ 
fication  is  embodied  in  RADCAN.  While  FULLRATSIMP  does  not  apply 
any  identities  concerning  logs#  radicals,  and  non-numeric 
exponents,  RADCAN  does. 

RADCANC EXP)  converts  tne  expression  EXP  into  a  form  which 
is  canonical  over  a  large  class  of  expressions  and  a  given 
ordering  of  variables;  that  Is,  all  functionally  equivalent  forms 
are  mapped  into  a  unique  form.  For  a  somewhat  larger  class  of  ex¬ 
pressions,  RADCAN  produces  a  normal  form;  that  is,  all  forms 
equivalent  to  zero  are  mapped  into  zero.  For  purely  rational  ex¬ 
pressions,  RADCAN  is  no  more  time-consuming  than  RATSIMP  or 
FULLRATSIMP;  however,  for  more  general  expressions  including 
radicals,  logs,  and  non-integer  exponents,  RADCAN  can  be  quite 
expensive.  This  is  the  cost  of  exploring  certain  relationships 
among  the  components  of  the  expression  for  simplifications  based 
on  factoring  and  part i al -fract ion  expansions  of  exponents. 

A  description  of  the  method,  and  proofs  of  the  canonical 
properties  of  the  RADCAN  algorithm  are  discussed  in  chapter  4. 
Examples  should,  however,  give  a  rough  feel  for  the  capabilities 
of  RADCAN.  (|  always  refers  to  the  just-previously  displayed  ex¬ 
pression,  %E  is  the  base  of  the  natural  logarithms): 
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CC1)  SQRT ( 98) @ 

CD1)  SQRT ( 98 ) 

( C2)  RADCANC|)@ 

(D2)  7  SQRT ( 2 ) 

(C3)  (SQRT (X**2-1))/(SQRT(X-1))@ 


CD3) 


2 

SQRT ( X  -  1) 


SQRT ( X  -  1) 


CC4)  RADCANC|)@ 

(04)  SQRT(X  +  1) 

(C5)  ( LOGC A**C  2*X)  +  2*A**X  +  l))/( LOG( A**X+1) )@ 

2  X  X 

LOGC  A  ♦  2  A  +1) 

CDS)  . 


X 

LOGC  A  +  1) 


CC6)  RADCANC I )  @ 

(D6)  2 

CC7)  C%E**X-l)/C%E**CX/2)+l)@ 

X 


C  D  7 ) 


IE  -  1 


X/2 

IE  ♦  1 


C  C 8 )  RADCANC  %) @ 

X/2 

CD8)  *E  -  1 


3.5,  The  RATSUBST  Crationel  substitution)  Commands 

RAISUBST^  or  RATSUBSTnC A# B, C)  where  n  *  1,  2,  3/  4  is  a  set 
of  similar  commands  to  substitute  A  for  each  occurrence  of  B  in 
the  expression  C.  In  those  cases  where  it  is  clear  where  B 
occurs,  the  result  will  correspond  to  the  intuitive  notion  of 
subst i tu  t i on . 
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If  B  Is  an  atom/  occurrences  of  B  are  obvious.  The  action 

taken  is  simply  substitution  followed  by  simplification. 

If  B  is  a  quotient,  say  b  /b  ,  then  RATSUBSTn( A, B, C)  is 

1  2 

entirely  equivalent  to  RATSUBSTn( A*b  ,b  ,C). 

2  1 

I  *  B  is  a  product,  all  coefficients  of  powers  of  B  can  be 
detected  in  C  by  a  technique  similar  to  that  used  by  RATCOEF. 
Hearn  in  (20)  suggests  this  approach.)  If  B  is  a  sum,  we  must 
define  what  we  mean  by  an  occurrence  of  an  expression  B  in  a 
polynomial  expression  C.  (If  C  Is  not  a  polynomial,  we  can 
consider  its  numerator  and  denominator  separately.) 

i 

lf  c  =  Z.  S  B  ,  then  B  is  said  to  occur  in  C  with  coef- 
i  i 

ficient  S  and  exponent  1,  coefficient  S  and  exponent  2,  ..., 

1  2 

and  remainder  S  .  if  B  occurs  in  such  a  fashion  we  wish  to 
0 

replace  C  by  2.S  A  .  Unfortunately,  finite  power  series 
i  i 

expansions  for  an  expression  in  terms  of  a  non-atomic 

.  ,  2  2 
subexpression  are  not  unique.  For  example,  C  *  x  +3xy+y  has 

(among  others)  the  following  expansions  in  (x+y): 

2  1  0 


1.  l*(x+y)  + 

0* ( x+y) 

+  x*y*(x+y) 

2 

1 

0 

2.  l*(x+y)  + 

x*(  x+y) 

-  x  *(x+y) 

2 

1 

C 

3.  l*(x«y)  + 

y*(x+y) 

-  y  *(x+y) 
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What  is  needed  is  a  set  of  restrictions  on  the  coefficients 

S  so  that  the  expansion  is  unique  and  appropriate  to  the  problem 
i 

at  hand.  This  is  the  basic  problem  in  substitution  for  simpli¬ 
fication,  and  this  solution  is  based  on  a  set  of  heuristics  for 
achieving  what  appear  to  be,  in  some  instances,  more  desirable 
results  than  have  been  possible  in  the  past.  We  will  separate  out 
only  the  highest  power  of  B,  and  discuss  at  each  stage 

n 

(recursively  on  lower  powers  of  B)  the  situation  C  =  SB  +  r, 

where  r  contains  the  lower  order  terms. 

As  we  have  pointed  out  earlier  in  our  discussion  of 

RATCOEF ,  the  ordering  of  variables  is  sometimes  quite  critical. 

"Sum"-hood,  which  is  a  property  of  a  form,  not  of  a  f unct i or . 

sometimes  depends  on  ordering.  For  example,  xz+yx  is  a  sum,  but 

(z+y)x  is  (for  purposes  of  RATSUBST)  nat  a  sum,  but  a  product, 

although  the  two  expressions  are  functionally  equivalent. 

Let  B  be  a  polynomial  containing  variables  v  ,v  ,,.,,v  , 

12  k 

where  the  highest  power  of  each  v  is  m  .  For  all  but  condition  2 

i  i 

below,  the  only  restriction  on  r,  the  remainder  consisting  of 
lower  order  terms,  is  that  it  has  lower  degree  than  C  does  in 
some  particular  variable  (namely,  the  most  important  on  the 
varlist  that  is  also  in  B).  The  cond i t i ons  bel ow  are  embod i ed  in 
the  commands  RATSUBST1, 2,3,  and  4,  respectively.  Their  effects 
can  best  be  gauged  by  frequent  reference  to  the  examples  in 
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figure  3.2  following.  RATSUBST  (without  a  number)  is  a  quicker 
program  than  the  numbered  ones/  which  short-cuts  many  of  the 
(rarely  needed)  conversions  and  re-conversions  required  for 
strictly  following  all  the  conditions. 

Condi tions 

1.  The  highest  power  of  some  v  in  S  thr.t  appears  in  B  is  less 

i 

than  the  corresponding  m  . 

i 

2.  The  highest  power  of  each  v  in  S  that  appears  in  B  is  less 

i 

than  the  corresponding  m  ,  and  the  highest  power  of  each  v  in  r 

i  i 

that  appears  in  B  is  less  than  the  cor  responding  m  . 

3 .  S  i s  a  pol ynomi al . 

4.  S  contains  no  sum. 

The  value  of  n  ranges  from  the  highest  possible  (the  ratio 

of  the  highest  coefficient  of  some  v  in  C  which  is  also  present 

i 

in  B,  to  the  corresponding  maximum  coefficient  of  that  v  in  B, 

i 

namely  m  )  to  the  lowest  possible  (when  some  v  in  B  is  no  longer 
i  i 

present  in  C  to  a  power  as  high  as  it  is  in  B,  or  1.).  To  avoid 

the  possibility  of  looping,  occurrences  of  B  in  C  are  replaced, 

as  found,  by  a  special  dummy  variable,  which  is  subsequently 

replaced  by  A.  Cases  in  which  B  occurs  in  A  (probably  an  error  on 

the  user's  part)  or  where  simplification  of  C  results  in  new 
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occurrences  of  B  can  be  treated  with  repeated  calls  to  RATSUBST. 
This  can  be  easily  programmed  In  MACSYMA. 

If  C  contains  non-rational  functions,  substitution  proceeds 
on  the  arguments  of  the  non-ratlonal  functions,  recursively.  Thus 
A,  B,  and  C  need  not  be  rational  expressions. 

1/2  By  notlng  when  B  has  non-ratlonal  components  (e.  g.,  e  ,  or 
x  ),  RADCAN  can  be  called  on  B  and  C,  and  they  can  be  left  in  a 
special  expanded  format,  which  tends  to  reflect  more  clearly  the 
similarities  of  the  two  expressions.  Thus 
RATSUBST(A,E**X,E**(2*X))  Is  A**2. 

An  example  of  an  extension  to  the  RATSUBST  framework  might 
serve  to  illustrate  its  generality.  If  there  is  a  canonical 
ordering  on  all  expressions  submitted  to  RATSUBST,  and  on  all 
intermediate  expressions,  then  a  RATSUBST5  could  be  programmed 
with  the  following  condition: 
n 

J».  SA  ♦  r  has  a  lower  canonical  order  ("is  simpler")  than  sb"  ♦ 
r . 

By  using  the  RATSUBST  commands  selectively,  such  substitu- 
2  2 

tions  as  sin  (x)  ♦  cos  (x)  — >  1  can  be  performed  more  nearly  in 
the  sense  in  which  they  are  intended.  If  one  RATSUBST  command 
does  not  do  the  job,  perhaps  another  will. 


104 


'■>.6.  The  SOLVE  Program 

The  SOLVE  command  in  MACSYMA  uses  several  techniques  for 
solving  for  a  given  variable  in  an  equation.  Each  of  these 
techniques  is  open  to  extension  in  a  straightforward  manner.  The 
roots  and  their  multiplicities  are  available  to  other  programs, 
and  are  used  as  building  blocks  for  more  complicated  facilities, 
such  as  contour  integration. 

The  format  of  the  SOLVE  command  is: 

SOLVE(equat i on.  variable  Ha 

where  the  equation  may  also  be  an  expression  (which  is  assumed  to 
be  set  equal  to  zero),  or  a  set  of  polynomial  equations  linear  in 
some  set  of  variables.  This  last  case  is  a  straight-forward 
problem  in  Gaussian  elimination,  and  will  not  be  discussed 
further  here. 

S0LVE( E, X)  puts  its  first  argument  E,  in  radical  canonical 
form,  and  attempts  to  factor  it  with  respect  to  the  var i able  X, 
and  all  non-rational  functions  in  E  containing  X.  Each  factor  is 
examined  for  being  linear,  quadratic,  cubic,  or  biquadratic  with 
respect  to  X  and  the  non-rational  functions  containing  it.  If  the 
factor  is  of  degree  five  or  more,  then  it  is  considered 

n 

unsol vable  unless  it  is  of  the  form  a(F(X))  ♦  b  in  which  case 

the  n  nth  roots  of  a/b  ore  generated,  and  the  n  equations  F(x)- 
1/n 

(a/b)  =  0  are  solved.  Any  remaining  unsolved  factors  and  their 

multiplicities  are  put  on  a  list  which  is  returned  along  with  the 


roots. 
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Linear  terms  of  the  form  F(X)-C  are  examined  to  see  if  C , 
the  constant  term,  is  actually  free  of  elements  containing  X;  If 
so,  USOLVE  is  called.  Otherwise  the  term  is  added  to  the  list  of 
unsolved  factors.  USOLVE  knows  the  inverses  of  S i N,  COS,  ASIN, 

ACOS,  TAN,  ATAN,  LOG,  etc.  and  powers  of  e.  it  could  be  extended 

to  other  functions.  Once  the  inverse  has  been  applied,  a  new 
equation  results,  it  may  be  of  the  form  X  -  FiNVERSE(C),  in  which 

case  the  term  has  been  solved,  or  It  may  be  of  the  form  G(X)  = 

FINVERSE(C),  in  which  case  SOLVE  is  called  again.  This  recursive 
algorithm  allows  for  solution  of,  for  example,  SIN(C0S(X))  «  0 
for  X. 

The  quadratic  (cubic,  biquadratic)  formula  is  applied  to 
quadratic  (etc.)  factors,  and  the  same  sort  of  recursive 
treatment  as  described  above  Is  used  if  the  equation  is,  for 
example,  quadratic  in  SiN(X)  Instead  of  X. 

The  simplification  done  by  the  quadratic  (etc.)  routines  is 
of  some  interest,  in  that  the  roots  in  the  formulae  are  simpli¬ 
fied  by  a  special  program  (SiMPNRT)  which  takes  out  perfect  n*k 
powers  of  a  kth  root.  (i.e.  even  powers  In  a  square  root, 
mul t iples-of-three  powers  in  a  cube  root,  etc.)  Thus  SQRT(8)  Is 
simplified  to  2*SQRT(2).  SIMPNRT  calculates  a  square-free 
factorization  of  the  radicand,  and  takes  appropriate  multiple 
factors,  if  any,  outside  the  radical. 

The  following  examples  illustrate  the  capabilities  of 

SOLVE: 
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(Cl)  SOLVE(Y**(2*X)-3*Y**X+2»0/X)G 
SOLUTION 


(El) 

X  ■  0 

(E2) 

LOG( 2) 
x  - - 

LOG(Y) 

(02) 

(El/ E2) 

( C3) 

A:  X**2-12*X+3U 

(D3) 

2 

X  -  12  X  ♦  3 

(  C4 ) 

SOLVE(SIN(A)**2-5*SIN(A)*3/X)Q 

SOLUTION 

(  E4 ) 

5  SQRT (13) 

X  ■  6  -  SQRT ( ARCS  1  N( . . )  ♦  33) 

2  2 

( E5) 

5  SQRT( 13) 

X  ■  SQRT ( ARCS  1  N( . )  ♦  33)  ♦  6 

2  2 

(EC) 

SQRT(13)  5 

X  «  6  -  SQRT ( ARCS  1 N( . ♦  -)  ♦  33) 

2  2 

( E7) 

SQRT( 13)  5 

X  -  SQRT( ARCS  1  N( . ♦  -)  ♦  33)  ♦  6 

2  2 


(D7) 


<E4,E5,EG,E7) 


107 


<C8)  SOLVE (ARCS  I N( COS( 3*X ) )*(F(X)-1),X)Q 
SOLUTION 

ARCCOS(O) 

(E8)  X  . 

3 

THE  ROOTS  OF 

(E9)  F( X)  -  1 

(U9)  ( E8, E9 ) 

(CIO)  SOLVE(5**X«125,X)(J 
(010)  X-3 

Note  that  SOLVE  has  taken  advantage  of  radical  approaches 
but  is  still  able  to  step  back  and  treat  fairly  general  expres¬ 
sions.  In  order  to  use  the  "radical"  polynomial  factoring  pro- 
gram,  it  uses  RADCAN  to  expand  unlikely-looking  expressions  into 

2X  X 

polynomials.  Thus  the  expression  Y  -3Y  +2  in  Cl  is  expanded  into 

X  Xlog(Y) 

a  polynomial  in  Z,  where  Z«Y  (actually  Z=e  ),  which  is 

then  factored  into  (Z-l)*(Z-2),  By  setting  each  of  these  factors 
equal  to  zero,  the  following  sequence  of  steps  is  followed: 
Xlog(Y) 

e  -1  3  0  is  converted  by  USOLVE  to 

Xlog(Y)  b  log(l)  which  the  simplifier  changes  to 
Xlog(Y)  b  o. 

SOLVE  is  called  recursively,  and  factors  this;  SOLVE 
throws  out  the  log(Y)  factor  since  it  does  not  depend  on  X,  and 
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the  factor  "X"  is  recognized  as  a  linear  expression  of  the  form 
aX+b  where  a-1  and  b-0,  which  has  solution  X--a/b,  or  in  this 
case,  X-0.  The  other  root  is  handled  in  an  analogous  fashion. 

3.7  Conclusions 

By  using  several  distinct  approaches  to  attack  different 
phases  of  the  same  problem,  particularly  powerful  algorithms  can 
be  obtained.  Although  ad  procedures  can,  in  some 
circumstances,  yield  similar  results  in  other  systems  for  alge¬ 
braic  manipulation,  MACSYMA's  SOLVE,  RATSUBST,  RATCOEF,  and  FULL- 

RATS  IMP  commands  provide  a  generality  and  power  not  available 
el sewhere. 

These  foundation  blocks  allow  the  building  of  new 
facilities.  SOLVE  is  used  by  programs  which  find  limits,  compute 
definite  integrals,  and  expand  functions  in  power  series. 

RATCOEF  is  used  by  the  semantic  pattern  matching  subsystem. 
FULLRATSIMP  is  used  by  RADCAN,  and  RADCAN,  in  turn  is  used  by 
SOLVE.  RADCAN,  furthermore,  can  be  used  as  the  basis  for 
implementing  the  Risch  (41)  Integration  algorithm. 
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Chapter  4  -  Simplification  of  Radical  Expressions 

4.1.  Introduction 

The  simplification  of  algebraic  expressions  is  a  many- 
faceted  problem.  On  one  hand,  all  of  the  work  in  simplification 
(and  algebraic  manipulation  in  general)  is  circumscribed  by  the 
work  of  Richardson  (39),  which  shows  that  for  a  sufficiently 
large  class  of  expressions  the  question  of  zero-equivalence  is 
undecidable.  Furthermore,  some  researchers  (e.g.  Fenichel  (16), 
Moses  (34),  (35))  argue  that  (regardless  of  computability)  the 
concept  of  simplicity  has  no  generally  acceptable  meaning.  On  the 
other  hand.  Brown  (3),  Caviness  (5),  (6>  and  others  show  that 
within  certain  classes  of  expressions  the  rigorous  notions  of 
canonical  forms  and  zero-equivalence  tests  can  serve  as  useful 
measures  of  simplicity.  For  a  survey  of  these  and  other  atti¬ 
tudes  and  achievements  in  algebraic  simplification,  see  Moses 
(34). 

The  importance  of  the  simplification  problem  in  algebraic 
manipulation  is  quite  basic:  A  "simplified"  expression  generally 
exhibits  its  most  significant  properties  in  a  systematic  fashion. 
This  can  make  mechanical  (or  human)  processing  of  the  expression 
much  easier. 

This  chapter  discusses  simplification  algorithms  for  the 
class  of  radical  expressions.  These  are,  roughly  speaking,  ratios 
of  multivariate  polynomials,  some  of  whose  "variables"  are  nth 
roots  of  polynomials.  These  expressions  commonly  occur  in 
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representing  roots  of  algebraic  equations  in  several  variables, 
and  are  rarely  treated  adequately  in  algebraic  manipulation 
systems.  The  only  current  alternative  to  the  treatment  we 
provide  in  MACSYMA  (and  describe  here)  is  a  computationally 
impractical  procedure  suggested  by  Caviness  in  (5). 

In  the  following  sections  we  will  proceed  to  define  the 
problem  of  simplification  of  radical  expressions  in  more  exact 
terms  and  contrast  our  approach  with  that  of  others  who  have  had 
similar  goals.  In  sections  4.2  and  4.3  we  discuss  basic  concepts 
and  define  the  class  of  radical  expressions  more  precisely.  In 
section  4.4  we  survey  previous  algebraic  approaches  to  radical 
simplification  and  a  promising  alternative,  zero-equivalence 
testing. 

Sections  4.5  and  4.6  discuss  the  specific  methods  we 
developed  for  MACSYMA.  Section  4.7  proves  some  properties  of  the 
simplified  form;  4.8  discusses  the  canonical  form  implications  of 
this  work;  4.9  points  to  other  related  efforts  in  MACSYMA,  and 
4.10  summarizes  its  usefulness. 

4.2.  Basic  Concepts 

Following  Caviness  (6),  to  be  given  a  class  q±  express i ons 
^  means  to  be  given  rules,  such  as  a  Backus-Naur  Form  (BNF) 
grammar,  for  determining  the  well-formed  expressions  in  the 
class.  The  expressions  must  be  formed  from  a  finite  set  of 
atomic  symbols,  a  subset  of  which  must  be  designated  as  var¬ 
iables.  A  member  of  £  not  containing  any  variables  is  a  constant . 
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Expressions  are  Interpreted  as  functions  over  the  domain  of 
constants. 

If  R  and  S  are  members  of  an  expression  class  <£,  R  is  said 
to  be  i dent i cal  to  S  if  R  and  S  are  the  same  string  of  atomic 
symbols.  This  relation  is  denoted  by  R  ;  S.  R  and  S  are  said  to 
be  functionally  equivalent  or  simply  equivalent,  if  for  all 
assignments  of  values  in  &  to  their  variables  for  which  they  are 
defined,  they  are  equal.  This  relation  is  denoted  by  R  ■  S.  Of 
course  R  »  S  implies  R  =  S. 

One  concept  related  to  simplicity  which  is  of  particular 
usefulness  is  that  of  a  canonical  form. 

Def ini tion  4.2-1  A  canonical  form  algorithm  f  for  a  class  of 
expressions  4  is  a  mapping  from  &  into  £  such  that  for  all  R,  S 
i  n  £, 

(i)  f (R)  =  R 

C i i )  R  *  S  ==>  f (R)  -  f (S) 

Dgf 1 n i tion  4.2-2  A  zero-eau i valence  test  algorithm  f  for  a  class 
of  expressions  ^  is  a  function  from  £  into  £  such  that  for  all  R, 
Sin  t, 

fCR)  *  0  <-•>  R  *  0 

The  constant  problem  consists  of  determining  the  zero-equivalence 
of  an  expression  containing  no  variables. 
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A  third  concept/  that  of  a  normal  form#  is  used  by  Cavlness. 

Ofi-Cinition  4.2-3  A  normal  form  algorithm  f  has  the  same  strong 
property  of  the  zero  equivalence  test  algorithm#  but  has  the 
additional  properties 

(i)  f(R)  *  R  (whether  or  not  R  °  0) 

(  ti)  f(R)  fits  a  "pattern." 

This  pattern  concept  is  not  generally  defined  but  can  be 
clarified  in  a  particular  situation.  For  example#  Brown's 
"simplified"  form  for  rational  exponential  expressions  (3)  is 
norma  1 . 

A  more  useful  concept  than  the  normal  form  is  that  of  a 
regular  elementary  (or  just  regular)  form  as  introduced  by  Rlsch 
(40). 

f 

definition  4.2-4  If  6  ■  e  or  log(f)  for  f  t  £  and  is  transcen¬ 
dental  over  £#  e  is  said  to  be  a  monomial  over  £.  If  0  is  a  root 
of  a  polynomial  with  coofficents  in  &  Irreducible  over  &  and  of 
degree  d(0)  at  least  2#  then  0  is  said  to  be  non-tr i vial 

algebraic  over  6.  Let  *3=  £(0  #  ...  ,0  ),  that  is#  £  with  n 

1  n 

normal  algebraic  extensions.  is  regular  elementary  over  <£  iff 

each  0K  is  a  monomial  or  is  non-trivial  algebraic  over 

£(0  #  ...  #0  ).  An  expression  g  *  d  is  reaul ar  e 1 ementar v  l f 

1  k-l 

the  degree  of  g  in  any  algebraic  0  is  less  than  the  defining 
degree  of  0,  d(0) . 
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Clearly  if  g  contains  6  to  some  higher  degree  than  d(0)/ 
reductions  can  be  made  to  remove  this  condition.  Any  g  e  fits 
the  impiied  "pattern"  of  a  rational  function  (ratio  of  two  poly¬ 
nomials)  because  any  expression  is  rational  once  a  regular  field 
description  is  found.  Thus  the  vagueness  of  the  normal  form  is 
removed. 

Section  4.8  returns  to  exponential  and  logarithmic 
monomials  briefly/  but  for  the  bulk  of  this  chapter  we  will  be 
concerned  only  with  the  non-trivial  algebraic  extensions. 

A  class  of  expressions  is  called  a  canonical  (normal/ 
regular)  class  or  is  said  to  possess  a  canonical  (normal/ 
regular)  form  if  there  exists  a  canonical  (normal/  regular)  form 
algorithm  for  it.  it  is  conventional  to  assume  that  if  R  =  0, 
then  f(R)  *  0. 

4.3.  Radical  Polynomials  and  Expressions 

Radical  polynomials/  P,  are  formed  from 

( i )  the  integers 

(ii)  the  variables  x  /  x  ,  .../  x  (collectively  called  X) 

1  2  N 

(iii)  the  operations  of  addition/  subtraction/  multiplication 

(iv)  the  un-nested  operation  of  exponentiation  to  a 
positive  rational  number. 

Radical  expressions  are  formed  from  radical  polynomials  with 
the  added  operation  of  division. 
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Examples  of  radical  expressions  are 

1/2 

(-X  +  4 ) 

1  1 

-  and  - 

1/3  5/4  2/5 

x  +  x  (x  /2  +  x  ) 

12  3  4 

The  expression 

1/2  1/2 
(x  +  3  ) 

is  not  in  the  class  P  because  of  the  nested  exponents. 

This  definition  is  a  slight  generalization  of  one  given 
by  Caviness  (5)  in  that  it  allows  more  than  one  variable. 

The  interpretation  given  to  radical  expressions  is  one 
which  we  believe  corresponds,  in  its  impl ications,  to  the  miost 
common  valid  usage.  As  does  Caviness,  we  interpret  radical  ex¬ 
pressions  as  algebraic  functions:  For  each  expression  E*P  , 
there  must  exist  an  irreducible  polynomial  P(z,X)  such  that 
P(E,X)  •  0.  Caviness  notes  the  necessity  of  simplifying 
3/2  1/2  2  1/2 

(x+1)  (x-1)  -  (x  -  1)  to  0  in  spite  of  the  following 

si tuat ion: 


I  f  we 


let  y  be  a  root  of 
1 

y  be  a  root  of 
2 

y  be  a  root  of 
3 


2 


y 

2 

y 

2 


y 


x+l 


x  - 1 
2 

x  -1 


then  y  y 
1  2 


2  1/2 

y  can  just  as  easily  be  +2  (x  -1) 

3 


as  0.  Perhaps  a 
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complete*  answer  would  include  all  these  possibilities.  Any 
interpretation  "cons  i  stent:'  (but  unspecified  by  Caviness)  should 
produce  0,  since  admitting  the  other  possibilities  is  tantamount 
to  declaring  all  algebraic  extensions  transcendental  over  the 
base  field  (and  therefore  subject  to  no  simplifications  at  all). 
Caviness  requires  that  some  branch  of  multiple  valued  roots  be 

2  1/2 

chosen.  Thus  (x  )  is  either  x  or  -x,  depending  on  the 

branch  of  the  square  root  chosen.  We  differ  from  Caviness  on 
this  point:  a  iiajLt icul ar  branch,  the  positive  real  branch,  to  be 
defined  shortly,  will  be  automat i cal  1 y  chosen  as  the  interpreta¬ 
tion  of  the  radical . 

In  general,  single-valued  branches  of  radicals  are  not 
analytic  everywhere,  and  hence  their  domains  must  be  suitably 
restricted  in  either  Caviness1  ££  our  interpretation. 

We  now  define  the  particular  interpretation  of  radicals 
which  we  use. 

DAfjjli  tion  4.3-1  A  polynomial  p(X)  is  said  to  be  positive  if  its 
leading  (integer)  coefficient  is  positive,  when  p  is  written  in 
some  canonical  form.  In  such  a  case  we  shall  write  p  2  0. 

flgf  i  n  i  1 1  fl/l  4.3-2  A  polynomial  or  integer  p  is  said  to  be  souarp- 
f ree  if  it  has  no  repeated  factors  (or  roots). 

If  P  is  a  positive  square-free  polynomial  and  m  is  a 

i/m 

positive  integer,  then  p  has  a  dps i t i ve  rea 1  i nterpretat ion 
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(£B1) .  All  other  instances  of  radicals  will  be  reduced  to  this 
case  in  defining  thel r  interpretations. 

fit-fin  ?  t  i  Qn  4.3-3  if  p  is  a  positive  square-free  polynomial  and  m 

1/m 

is  a  positive  integer,  then  p  has  the  positive  real  interpre¬ 
tation  ( PR  I )  defined  as  follows: 

1/m 

case  1:  p  is  a  positive  square-free  integer,  p  is  interpreted 
as  the  positive  real  mth  root  of  p. 
case  2:  p  is  a  polynomial  in  one  variable,  say  x.  The  coefficient 
of  the  leading  term  in  p(x)  is  a  positive  integer,  so  that 
there  exists  a  real  number  l  such  that  for  all  %  >  L,  p(%)  is  a 

i/m 

positive  real  number.  By  case  1,  for  each  %,  (p(%))  has  a 

1/m 

PRi .  The  PR i  for  (p(x))  is  then  this  branch  of  the  solution 
m 

to  z  -p(x)  *»  0  which  has  positive  real  values  for  x  >  L. 
case  3:  M-variable  polynomial  (M  >  1).  Assume  a  recursive  poly¬ 
nomial  representation  as  in  chapter  3.  it  is  possible  to  fix 
values  for  all  but  the  main  variable,  say  x,  such  that  the 
coefficient  of  the  leading  term  in  x  is  positive.  Then  a  PRi 
for  p(x)  is  defined  as  in  care  2.  For  example,  consider  the  3- 
variable  polynomial 

2  2 

p( x, y, z )  =  (y  -  (z  +  1)  y)  x  -  3  x  y  ♦  2 
choose  z  =  0  (arbitrary) 

2 

choose  y  =  2  to  make  y  -  (z  ♦  1)  y  positive  (namely  2) 
then  for  x  >  2,  p  has  a  PRI. 
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Now  let  us  define  interpretations  for  more  complicated 
radicals.  We  can  assume  that  any  radicand  is  at  worst  a  rational 
expression  p(X)/q(X)  where  p(X)  and  q(X)  are  relatively  prime 
polynomials  in  canonical  form/  and  q(x)  is  positive  and  nonzero. 

n/m 

(p(X)/q(X))  is  interpreted  as  the  ratio  of  the  interpre- 
n/m  n/m 

tat  ions  of  p(X)  and  qlX)  .  Thus  all  radicandscan  be  assumed 
to  be  polynomials. 

n/m 

If  p(X)  is  not  positive/  p(X)  is  interpreted  through  the 

i  rr/m  n 

use  of  a  primitive  root  of  unity  to  -  e  as  (w  )  times  the 

2m  2m 

n/m 

interpretation  of  (-p(X))  .  Thus  all  radicands  can  be  assumed 

to  be  positive. 

If  p( X)  is  positive  but  not  square-free/  It  is  easy  to 
prove  that  p(X)  may  be  factored  into  positive  square-free 


factors.  Thus  if 


k  i 

p(X)  =  T[  (p  (X))  / 
i«l  i 


the  interpretation  of  p(X)  is  the  product  of  the  interpreta¬ 


tions  of 


(P  (X)) 
I 


i  n/m 


for  i  =  1,  .../  k.  Thus  all  radicands  can  be  assumed  to  be 
square-free. 


If  n  2  i»/  then  for  n  *  q  m  ♦  r,  for  0  i  r  <  m#  p(X) 
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interpreted  as  the  product  of  p(X)  and  the  interpretation  of 
r/m 

PtX)  .  Thus  we  can  assume  n  <  m. 

If  n/m  is  not  in  lowest  terms,  it  can  be  converted  to 
lowest  terms.  Thus  we  may  assume  the  greatest  common  divisor  of 
n  and  m  is  1 . 

n/m 

I f  n  *  0,  p(X)  is  interpreted  as  1. 
n/m 

If  n  >  1,  p( X)  is  interpreted  as  the  nth  power  of  the 

1/m 

interpretation  of  p(X)  .  Thus  ail  interpretations  are  based  on 
definition  4.3-3. 

2  1/2 

According  to  the  PRI  interpretation,  (x  )  means  x  and 
1/2  1/2 

(20)  means  25  ,  a  positive  number.  We  believe  this 

corresponds  to  the  most  common  usage. 

We  should  point  out  that  some  of  the  transformations  used 
for  the  algorithms  to  follow  are  the  basis  for  innumerable  false 
"proofs"  These  proofs  are  generally  based  on  inconsistent  inter¬ 
pretation  of  radicals,  and  will  not  occur  in  our  usage.  For 
example,  using 

b  b  c  c  c  c 

a  *  a  a  and  (ab)  *  a  b 

we  can  "prove" 

1  1/2  1/2  1/2  1/2  1/2  1/2  1/2 
1  r  1  =  1  1  -1  (-1)  (-1)  «  (-1)  (-1) 

1 

»  (-1)  =  -1. 
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Since  MACSYMA  will  impose  positive  real  interpretations  on 
radicals,  it  will  not  factor  1  into  (-D(-l)  and  fallacies  of 
this  sort  will  not  occur. 


It  is  interesting  to  compare  our  interpretation  of  radicals 
with  one  which  is,  some  would  argue,  most  common,  namely,  that 

2  1/2 

t  e  express i oi  (x  )  means  |x|.  For  example,  the  modulus  of  c  - 


a+bi  is  written  as 


2  1/2 
b  ) 


we  are  left  with 


L  2  1/2 

the  convention  that  lal  -  (a  )  .  since  this  holds  only  when  a 
assumes  real  values,  and  the  square  is  computed  before  the  square 
root,  the  usage  is,  in  fact,  consistent  with  a  positive  real 


.  „  1/2  1/2 

interpretation.  In  general  x  meaning  |x  |  is  restricted  to 

the  domain  of  non-negative  real  x. 


In  summary,  there  are  (at  least)  three  interpretations  for 
radical s. 

1.  Caviness',  which  does  not  choose  a  branch  of  the  algebraic 
f unct i on; 

2.  Ours,  which  chooses  the  PR  I ; 

3.  The  "common"  square  root  which  implies  absolute  value  with 
restricted  domain. 

The  last  two  are  equivalent  on  a  restricted  domain,  and  the 
first  two  are  equivalent  up  to  the  choice  of  a  branch. 
Computationally,  interpretation  (2)  has  a  distinct  advantage  over 
either  of  the  others  in  that  it  is  consistent  over  a  larger 
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domain  than  (3),  and  does  not  unnecessarily  involve  arbitrary 
roots  of  unity  as  in  (1). 

4.4.  Comparisons  with  Previous  Work  on  Radical  Expressions 
4.4.1  Algebraic  approaches 

Caviness  proves  in  (5)  that  for  an  expression  E  til,  "E  ■ 
0"  is  decidable.  Unfortunately,  the  application  of  his 
constructive  proof  relies  on  an  impractical  (and  largely 
unnecessary)  computation.  The  problem  lies  in  the  difficult  task 
of  factoring  over  algebraic  extensions  of  a  polynomial  ring. 
Caviness  points  out  that  the  need  for  factoring  is  a  result  of 
the  lack  of  i r reduc i b i 1 i ty  criteria  for  the  radical  expressions. 
He  develops  a  few;  we  extend  his  results  and  show  that  satis¬ 
factory  results  can  generally  be  obtained  without  any  factoring. 

The  results  here  appear  to  conform  more  closely  to  intu¬ 
itive  notions  of  s i mpl i f ' cat  ion  than  does  Caviness'.  More 
important,  they  are  far  more  easily  computable,  since  the  only 
calculation  needed  is  that  of  the  greatest  common  divisor  (gcd) 
of  multivariate  polynomials  with  integer  coefficients. 

The  difficulty  in  Caviness'  approach,  from  a  practical 
standpoint,  is  his  interpretation  of  radicals  as  written  in  an 
expression.  His  approach  can  be  most  easily  seen  in  Van  der 
i.aerden  (47),  section  36.  Briefly,  given  any  finite  number  of 
radicals,  an  algebraic  extension  to  the  field  of  rational 
expressions  may  be  constructed  to  which  all  the  radicals  belong. 
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Each  expression  in  this  field  will  have  a  unique  representation 
within  the  field.  The  construction  of  this  field  takes  a  finite 
number  of  steps.  Given  a  radical  expression  E,  it  is  only 
necessary  to  explicitly  construct  a  suitable  field  which  contains 
E/  and  find  the  unique  representation  of  E  in  that  field.  This 
representation  can  always  be  found  in  a  finite  (but  possibly 
large)  number  of  steps.  This  does  not  produce  a  canonical  form 
since  there  are  an  infinite  number  of  fields  which  will  contain 
l,  and  the  representation  of  E  in  the  different  fields  may 
differ.  However,  given  two  non-identical  equivalent  expressions, 
a  field  may  be  constructed  which  contains  them  both,  and  in  which 
they  are  identically  represented. 

An  unpublished  report  by  S.  L.  Kleiman  (25)  proposes  a 
canonical  form  for  rational  expressions  in  several  algebraically 

2 

dependent  variables  (e.g,  f(x,y)  where  y  +x"l).  The  procedures  he 
suggests  have  never  been  implemented,  nor  would  they  be  computa¬ 
tionally  efficient;  nevertheless#  his  discussion  of  the 
problems  involved  is  quite  thorough.  He  avoids  the  question  of 
interpretation  of  radicals  by  introducing  new  variables  which 
satisfy  certain  polynomial  equations. 

By  contrast,  our  approach  (by  applying  i r reduc i b i 1 i ty 
criteria  and  simplifications)  is  to  produce  a  field  which  allows 
all  permiss'  le  simplification  to  be  performed.  Many,  but  not 
all  expressions  are  mapped  into  canonical  forms  by  this  approach. 
Those  not  in  canonical  form  are  easily  distinguishable  from  the 
others  by  the  presence  of  roots  of  -1. 
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4.U. 2  Zero-equivalence  tests 

it  has  been  shown  by  Richardson  (39)  and  Johnson  (24)  that 
zero-equivalence  tests  for  the  class  of  expressions  treated  here 
(and  other,  larger  classes)  can  be  reduced  to  the  "constant" 
problem;  that  is,  all  references  to  variables  can  be  removed  in 
determining  zero-equivalence,  assuming  the  expressions  are 
totally  defined  over  the  domain  of  interest.  The  constant 
problem  is  non-trivial,  since  very  little  is  known  about  such 

e  1/4 

specific  constants  as  e+tr  or  e  ;  also  if  (-1)  stands  for  a 

in/  4 

primitive  fourth  root  of  -1,  for  example,  e 

1/4  3/4  1/2 

(4. 4. 2-1)  (-1)  -  (-1)  -  2 

is  a  constant  which  is  0,  but  not  obviously  so.  The  constant 
problem  does  not  concern  us  here  because  it  is  decidable  for 
adical  constants  by  methods  used  by  Caviness,  while  using  our 
interpretat ion  of  radicals  it  only  crops  up  with  roots  of  -1. 

We  discuss  zero-equivalence  tests  in  some  detail  because 
they  serve,  in  some  instances,  as  a  potentially  very  powerful 
tool  in  simplification,  in  some  cases  decisions  as  to  zero- 
equivalence  may  be  all  that  is  needed.  Secondly,  given  a  zero- 
equivalence  test,  we  can  produce  a  canonical  form  algorithm  in 
the  following  way:  Assume  we  wish  to  find  a  canonical  form 
algorithm  for  a  class  of  expressions  but  only  have  a  zero- 
equivalence  test  over  that  class.  We  can  produce,  in  lexico- 
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graphic  order,  all  legal  members  of  the  class  (say,  in  size 
place,  up  to  and  including  the  length  of  the  expression  f  under 
consideration).  The  first  generated  expression  g,  such  that  f-g 
is  0,  is  the  canonical  form.  Although  this  is  clearly  unsatls- 
factory  as  a  practical  computational  approach,  it  does  provide 
some  theoretical  unity  to  the  concepts.  Furthermore,  research 
along  the  lines  of  the  approach  Illustrated  below  promises  to 
provide  especially  useful  insight  into  the  ways  expressions  can 
combine.  This  is  particularly  relevant  for  classes  of  expression 
much  larger  than  R. 

Let  us  Illustrate  the  approach  of  Johnson's  (24)  zero- 

B 

equivalence  test.  Let  &  consist  of  expressions  of  the  form  A  ,  A 
a  rational  function  in  one  variable,  x,  and  B  a  rational  number. 
Let  &  consist  of  products  of  elements  of  Radical  polynomials 
are  sums  of  elements  of  t  .  Define  the  function  L(u)  *  (du/dx)/u 
for  u  4  6  ,  Any  element  u  of  is  called  an  e 1 genvector  (of  the 
derivative  operator)  whose  eigenvalue  is  L(u).  Eigenvalues  are 
always  rational  functions  of  x,  since 


L(A*B) 

*  L( A)  + 

L(B) 

L( A/B) 

«  L( A)  - 

L(B) 

B 

L( A  )  • 

■  B*  L(  A) 

B  a 

rat i onal 

number 

L(x)  « 

1/x 

L(B)  = 

0 

B  a 

rat i onal 

number 

L( A+B)  *  (dA/dx  ♦  dB/dx)/(A  ♦  B) 


A,  B  rational 
functions  of  x. 
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Since  we  can  always  decide  whether  or  not  a  rational  func~ 
t ion  of  x  is  zero/  we  can  always  tell  whether  or  not  L(u)  for 
u  is  zero.  The  basis  for  the  algorithm  is  the  fact  that  for 

u  equivalent  to  a  constant,  L(u)  ■  0. 

Suppose  we  can  decide  if  a  constant  is  zero.  Assume  we 

have  a  set  of  eigenvectors  u  ,  !■  1,  . ..,  n  (and  have  calculated 

t 

their  eigenvalues  by  the  above  rules).  We  may  decide  if 
n 

s  -  £  U 

1-1  i 

is  zero  as  fol lows: 

STEP  1:  If  n  ■  1  and  L(u  )  is  not  zero,  S  f  0.  Otherwise  S  is  a 

1 

constant.  By  assumption  we  can  decide  if  the  constant  is 
zero.  Return. 

STEP  2:  if  n  >  1  then  consider 

T  *  2Z  u  /u  . 

1-1  i  n 

T  is  a  sum  of  eigenvectors  (whose  eigenvalues  are  known)  and 
whose  last  term  is  1.  Test  T  for  being  equivalent  to  zero 
(see  below). 

STEP  3i  if  T  f  0,  S  t  0.  Return. 

STEP  4:  if  T  *  0,  then  K  -  S/u  is  a  constant.  By  assumption  we 

n 

can  test  if  a  constant  is  zero.  \fKJ0,  then  S  »*  0.  if 
K  ■  0,  S  ■  0.  Return. 

We  must  now  explain  step  2.  Consider 

n-1 

T  B  ZZ  u  /u  ♦  1 
i-1  i  n 

If  any  of  the  eigenvalues  of  u  /u  ,  i  «  1,  ...,  n-l  are  zero, 

i  n 
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delete  them  from  T.  (The  eigenvalue  of  1  Is  0,  and  so  1  Is 
deleted.)  if  all  the  eigenvalues  are  zero,  T  ■  0.  Otherwise 
T  Is  a  sum  of  at  most  n-1  eigenvectors  (with  known  eigen¬ 
values)  so  this  algorithm  can  be  applied  recursively  to 
determine  whether  or  not  T  «  0. 

An  Example: 

Cons  1 der 

1/2  1/2 
S  *  2  x  -  ( 4x) 

The  eigenvalues  for  2  x^  and  -(4x)1/2  are  the  same,  namely 
l/( 2x) .  in  step  2  of  the  algorithm  we  set 

1/2 

2  x 

T  *  -------  +  i 

1/2 

(4  x) 

T.  ,  ,  1/2  1/2 

The  eigenvalue  for  (2  *  >/Ux)  Is  1/(2*)  -  1/(2*)  .  0.  This 

Implies  that  T  (and  thus  S/u  )  Is  a  constant.  The  particular 

n 

constant  value  of  S/u  must  be  determined  by  other  means.  Such 

n 

means  should  reveal  that  S/u  Is  In  fact  0. 

n 

Several  Important  facts  should  be  noted.  First,  the 
problem  of  deciding  when  a  constant  Is  zero  Is  not  solved. 

Second,  If  an  expression  is  nol  zero,  a  "simplified"  equivalent 
expression  is  not  generally  produced.  Third,  the  class  of  eigen¬ 
vectors  can  be  extended  to  other  expressions  (e.g.  eA,  for  A 
rat ional  in  x) . 
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Richardson's  (39)  scheme/  which  is  somewhat  more  compli¬ 
cated/  does,  however/  allow  for  composition  of  functions.  His 
method  has  been  extended  to  a  large  class  of  functions  defined  by 
first  order  non-linear  differential  equations  by  Moses/ 
Rothschild/  and  Schroeppel  (36). 

Zero-equivalence  tests#  although  an  area  of  theoretical 
importance/  cannot  at  present  be  considered  as  useful  as  some 
other  notions  of  simplification/  especially  canonical  forms, 
within  the  context  of  algebraic  manipulation  systems.  We  are 
hopeful  however,  that  research  in  this  direction  will  produce 
useful  information  for  algebraic  manipulation  system  designs,  and 
have  for  this  reason  included  this  section. 


4.5.  Simplified  Radical  Polynomials 
In  this  section  we  present  two  closely  related  simplified 
forms  for  a  radical  polynomial.  Each  looks  like  a  multivariate 
polynomial,  some  of  whose  variables  are  radicals. 


Let  v  ,  k  ■  1, 
k 


(4.5-1) 


v 

k 


...  ,  N  be  a  set  of  radicals  of  the  form 
1/m 

k 

(p  ) 
k 


fief  ini tign  4.5-1.  In  form  (1),  each  m  is  an  integer  >  1  and  each 

k 

P  is  a  positive  square-free  integer  or  polynomial  with  no 
k 

(integer  or  polynomial)  factors  in  common  with  any  p  ,  j  f  k. 

j 
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In  form  (2),  the  p  are  distinct  positive  p^ime 

k 


numbers  or  positive,  primitive,  ^reducible  (over  the  integers) 
polynomials.  Form  (2)  is  a  special  case  of  form  (1). 


With  this  definition  of  (v  ], 

k 

nomial  has  the  form; 


a  simplified  radical  poly- 


(4.5-2) 


m  -1 

Q( v  )  -  a  v' 

k  Z — ,  |,k  k 

1-0 


where  each  a  is  an  Integer,  a  polynomial,  or  a  simplified 
I ,  k 


radical  polynomial  In  other  radicals  v  ,  j  <  k. 
For  example. 


1/2  1/4  1/2  1/2 

2  +3  +6  ♦  (x  x  ) 

1  2 

can  be  represented  as  a  form  (1)  simplified  radical  polynomial 
by; 

1/2  1/4  1/2  1/4  2  1/2 

2  ♦  3  +2  (3  )  ♦  (x  x  ) 

1  2 

and  as  a  form  (2)  simplified  radical  polynomial  by; 

1/2  1/4  1/2  1/4  2  1/2  1/2 

2  +  3  ♦  2  (3  )  ♦  (x  )  (x  ) 

1  2 

There  are  some  radical  polynomials  which  cannot  be 
represented  in  either  of  the  above  simplified  forms,  e.g. 
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1/2 

(-1)  .  To  allow  for  representing  such  expressions,  we  define 

forms  Cl*)  and  C  2  * ) .  In  these,  a  single  primitive  nth  root  of 

unity,  cjn,  may  be  adjoined  to  the  set  {v  With  this  addition, 

1/2  k 

(-1)  *  to4. 

We  initially  excluded  .+  1  from  the  set  fp  }  because 

k 

expressions  involving  roots  of  unity  cannot  be  handled  as 
authoritatively  (by  the  methods  wc  use)  as  other  expressions.  By 
agreeing  that  any  root  of  +1  is  1,  we  are  left  only  excluding 
roots  of  -1.  By  writing  these  roots  in  terms  of  other  expres¬ 
sions,  even  these  symbols  may  be  effectively  removed.  For 
example,  the  expression  (4. 4. 2-1)  mentioned  earlier  will  not  be 
reduced  to  zero  automatically.  On  the  other  hand,  MACSYMA  allows 

in/4  1/2  1/4 

e  or  2  (l+D/2  to  be  substituted  for  (-1)  ,  and  would  then 

simplify  the  resulting  expression  to  0. 

Since  many  expressions  can  be  represented  in  the  forms  (1) 

and  (2),  it  is  significant  that  each  of  these  is  the  basis  for  a 

canonical  form.  Furthermore,  algorithms  for 

(a)  Converting  any  radical  expression  into  a  ratio  of  radical 

pol ynomi al s 

and  (b)  converting  a  radical  polynomial'  Into  forms  (1)  or  (2) 
are  relatively  straightforward,  given  programs  for  factoring 
polynomials  and  computing  polynomial  greatest  common  divisors. 
Details  of  such  algorithms  are  contained  in  the  next  section. 
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4.6  Algorithms 

4.6.1  Removing  quotients  from  radicals 

Let  us  first  consider  the  radicands  in  a  given  radical 
expression.  A  radicand,  If  not  already  in  the  form  of  the  ratio 
of  two  polynomials  with  integer  coefficients  whose  gcd  is  1  may 
be  straightforwardly  transformed  into  such  a  form  by  rational 
simplification.  Chapter  3  describes  how  this  can  be  done  by  the 
RATSIMP  program. 

A  and  B  below  are  relatively  prime  polynomials  over  X,  with 
integer  coef f ic ients,  and  r  is  a  rational  number. 

If  r  >  1,  say  r  -  s  ♦  q,  for  Oi  q  <  1,  then 

r  s  q 

AAA 


Otherwi se, 

r  r 

A  A 

(---)  . 

B  r 

B 

4.6.2  Producing  a  ratio  of  radical  polynomials 

To  transform  the  expression  into  a  ratio  of  radical 
polynomials  requires  one  further  step.  The  expression  must  be 
expanded  over  a  common  denominator  into  the  ratio  of  two 
polynomials.  RATSIMP  can  be  used  for  this  purpose. 

4.6.3  Simplifying  radical  polynomials 

The  algorithm  below  produces  a  simplified  form  for  the 
radical  expression  by  treating  its  numerator  and  denominator  as 
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radical  polynomials.  Strictly  speaking  If  the  result  Includes 
powers  of  roots  of  unity/  we  do  not  consider  It  truly  simplified. 

STEP  Is  Make  a  list  of  all  radlcands  In  the  expression  {S 
I  -  1/  .../  N.  1 

STEP  2:  For  I  «  1,  . ..,  N,  factor  S^  Into  a  product  of  positive 

prime  numbers  and  a  polynomial  In  canonical  form  (such  as 
that  produced  by  RATSIMP).  If  the  polynomial  has  a  negative 
leading  coefficient/  add  -1  to  the  list  of  factors,  and 
multiply  the  polynomial  by  -1. 

If  form  (2)  is  required,  factor  the  polynomial  into 
irreducible  factors. 

If  form  (1)  Is  required,  factor  the  polynomial  Into 

square-free  factors  (the  SQFR  command  of  Chapter  3).  This 

uses  the  operations  of  differentiation  and  polynomial  GCD 

only,  and  Is  much  faster  than  full  factorization. 

STEP  3:  Now  for  each  g  -  gcd(S  ,S  )  I  >  k  such  that 

I  /  k  Ik 

g  i*  i,  "factor”  both  S  and  S  into  g  and  another 
‘'k  I  k  i,k 

factor  which  is  computed  by  polynomial  division. 

STEP  4;  Reduce  powers  of  a  common  base  to  powers  of  the  base  to 
the  lowest  common  degree  of  the  radical  powers.  That  is.  If 
2/12  1/2 

2  and  2  are  the  only  occurrences  of  the  base  2, 

1/6  1/6  3 

and  (2  )  respectively.  For  some 


replace  them  by  2 
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expressions/  one  of  the  bases  will  be  "tv "  in  order  to 

n 

represent  roots  of  -1  as  powers  of  to. 

STEP  5:  Rationally  simplify  the  expression,  considering  each 

distinct  radical  as  a  "variable."  Thus 

1/6  3  1/6  1/6  i/6  2 

)  ♦  (2  ))/(2  }  would  be  simplified  to  ( 2  )  ♦  1. 

STEP  6:  Simplify  the  resulting  expression  by  the  transformation 

be  be  1/6  7 

'a  '  a  ^or  be  J>  1.  Thus  (2  )  would  be  simplified 

1/7 

to  2  2 

STEP  7;  If  step  6  has  caused  any  changes,  go  to  step  5. 

be  be 

STEP  8:  Simplify  (a  )  to  a  ,  be  a  proper  fraction  in  lowest 

1/6  3  1/2 

terms:  e.  g.  (2  )  is  simplified  to  2 

STEP  9:  (optional)  Rationalize  the  denominator,  (see  section 
4.6.4  below) 

STEP  10:  (optional)  Combine  products  of  radicals.  E.  g. 

1/3  2/3  1/3 

(2  3  )  — >  (18) 

This  algorithm  terminates  for  any  finite  radical 
expression.  Clearly  all  the  individual  steps  can  be  done  in  a 
finite  time.  The  only  loop  depends  on  step  6  causing  a  change  in 
the  expression.  If  we  consider  a  recursive  polynomial  repre- 
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sentation  of  either  the  numerator  or  denominator  of  the 

expression/  step  6  can  be  applied  once  and  only  once  for  each  of 

the  different  v  in  the  polynomial.  Thus  the  loop  of  steps  5, 
k 

6/  and  7  will  also  terminate. 


4.6.4  Rationalizing  denominators 

A  radical  polynomial#  Q(v  )#  is  a  root  of  a  polynomial  S(z) 

k 

with  rational  (in  fact#  integral)  coefficients.  I f  to  is  a 

m 

primitive  (m  )th  root  of  unity#  the  other  (m  -  1)  conjugate  roots 
k  k 

of  S  are 

m  -1 
k 

Q  (&  v  )#...#  QC((o  )  v  ) . 
m  k  m  k 

k  k 

The  product  of  these  other  m  -  1  roots#  R  #  is  a  rationalizing 

k  k 

multiplier  in  that  Q(v  )  R  is  free  of  the  radical  v  .  That  is# 

k  k  k 

radicals  involving  roots  of  p  have  been  eliminated.  This  proce- 

k 

dure  can  be  repeated  for  each  of  the  v  #  k  =  N#  ...#  1#  and  all 

k 

of  the  radicals  can  be  eliminated. 

Multiplying  the  numerator  by 
N 

R  =  TI  R 
i«l  i 

completes  the  procedure  in  theory#  but  an  additional  task 
remains.  A  simple  example  illustrates  the  problem.  Let  us 
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rationalize  the  denominator  of 

2/3  1/3 

(4. 6. 4-2)  1/Q  “  l/(x  ♦  x  ). 

Proceeding  as  above/  we  see  that  there  is  just  one  radical/ 

1/3  1/3 

namely  x  in  the  expression.  Thus  k  »  1/  and  v  a  x  .  Then 

1 

the  denominator/  Q/  in  terms  of  v  has  the  form 

1 

2  2  2 
Q(v  )  ■  (v  )  ♦  v  .  The  conjugate  roots/  cj  v  and  w  v  /  when 

111  3131 

substituted  in  Q/  produce  two  other  polynomials.  R,  the  product 


of  these  three  polynomials  (noting  the  simplification  (^  )  *  1) 

3 

becomes : 

1/3  2  2/3 

(4. 6. 4-3)  x  x+(W+uj)x  +  x 

3  3 

This  is  not  satisfactory  because  the  fact  that 


2 

(4. 6. 4-4)  to  ■  -W  -  1 
3  3 

must  be  used  to  effect  a  further  simplification.  Equation 

(4. 6. 4-4)  can  be  deduced  from  the  cyclotomic  polynomial  (see  (47) 

p.  113): 

2 

(4. 6. 4-5)  4>  (x)  =  x  ♦  x  ♦  1 

3 

whose  roots  are  cube  roots  of  unity. 

We  can  generate  any  cyclotomic  polynomial  by  a  procedure 
described  in  (47)  as  follows.  Define  the  "MoMus  funct  ion"  ^(u) 


by 
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Mi n)  o 


Then 


fl  if  n  =  1 

0  if  for  some  integer  p,  ^|n 

(-1)  if  n  a  p  ...  p  ( | .e.  n 
1  L 


d>  (x)  =  IT  (xd 

h  d  |  h 


4<h/d) 

1) 


is  square-free) 


VX)  'S  easMy  ca,culated  in  MACS YMA.  Now  with  the  added  step 


of  simplifying  roots  of  unity  in  R  according  to  the  cyclotomic 
equation,  and  resimplifying  the  result,  the  procedure  is  almost 
complete.  For  even  roots,  there  is  an  additional  step  of 

eliminating  symr  *ric  values,  (e.g.  uJ  •  -to)  Although  this  does 

8  8 

not  guarantee  a  canonical  form  for  the  numerator  because  of  such 

relations  as  (4. 4. 2-1),  it  will  result  in  the  simplification  of 

the  denominator.  In  our  example,  we  find  that 

1/3  2/3  2 

R*x  x-x+x  and  R  Q  ■  x  ♦  x. 

Thus 


.  1/3  2/3 

1  x  X  -  X  ♦  X 

2/3  1/3  2 

x  ♦  x  x  ♦  x 


4.7  The  Properties  of  trie  Simplified  Form 
4.7.1  Overview  and  History 

Our  object  is  to  prove  that  any  radical  polynomial  which 
nay  be  written  in  form  (2)  is  in  canonical  form.  Lei  Z  be  the 
ring  of  integers.  Basically  we  wish  to  show  that  sufficient 
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I rrcducl bl 1 1 ty  criteria  may  be  obtained  to  show  that  each  v  Is 

k 

non-tr I  vial  algebraic  over  2[X](v  ,...,v  ),  and  that  we  have  a 

1  k-1 

regular  form  for  radical  polynomials.  It  is  clear  that  roots  of 
-1  will  have  to  be  dealt  with  In  another  manner.  With  suitable 
restrictions  to  the  class  of  representatives  of  the  extensions, 
the  regular  form  can  be  strengthened  to  a  canonical  form. 

Cavlness  In  (5)  stated  and  proved  several  I rreduclbi 1 1 ty 
criteria  which,  for  his  approach  to  the  problem,  seemed  quite 
reasonable.  That  they  are  not  strong  enough  Is  primarily  due  to 
his  interpretation  of  radicals,  which  caused  him  to  first  extend 
the  base  field  by  a  primitive  nth  root  of  unity-  This  Is  not 
necessary  In  many  instances,  and  much  stronger  results  can  be 
obtained  without  this  first  extension. 

4.7.2  The  theorem 
Theorem  4. 7.2-1 

Let  n  >  1  be  any  Integer,  X  ■  jx  ,  ..„  x  }  be  a  set  of 

1  N 

Indeterminates,  and  p  ,  ...,  p  be  a  set  of  (distinct)  positive 

1  k 

prime  numbers  or  primitive  polynomials  over  X,  irreducible  over 

,  ,  1/n 

the  integers.  Let  denote  the  rational s,  and  p  denote  the 

I 

positive  (or  any  fixed)  nth  root  of  p  .  Then  the  field 

1/n  1/n  1 

jQ( X ) ( p  ,  ...,  p  )  is  of  degree  n  over  Q(X). 

Ik 
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Equi valentl y, 

elements 

m  /n 
1 


Let  E 

k 


m  /n 
k 


{e  )  denote  the  set  of  n 
1 


0  1  m  <  n 
h 


P  *  ...  *  p 

1  k  lih  ik 

Then  the  set  E  Is  linearly  Independent  over  £(X). 
k 

The  p  which  are  polynomials  may  be  square-free  and 


relatively  prime  in  pairs  instead  of  Irreducible  without  altering 

the  result.  Also,  n  need  not  be  fixed  over  the  p  :  then  the  n 

i 


of  the  theorem  can  be  the  least  common  multiple  of  any  number  of 
distinct  n's.  (Thus  If  fifth  and  cube  roots  appear,  we  deal  v/ith 
fifteenth  roots.)  Note  that  roots  of  unity  are  uat  included  Jn 
this  theorem;  1  is  not  a  prime  number. 

Proof 

STEP  1:  Reduction  to  the  case  n  is  a  power  of  a  prime  number. 

1/n  1/n 

Let  F  ■  jQt( X ) ( p  ,  ...,  p  )  for  n  -  1,  ...  .  Clearly 

1  k 

k 

F  o  F  and  F  .  Hence  if  F  Is  of  degree  n  and  F  is  of  degree  m 
mn  n  m  p  m 

k  k 

the  degree  of  F  is  divisible  by  (mn)  ,  hence  equal  to  (mn) 
mn 

since  it  is  obviously  not  greater.  Thus  it  suffices  to  consider 
n  a  power  of  a  prime  number. 


Let  F  -  fl(X),  F 

n,0  n,l 


1/n 

F  (p  ),  etc.,  so  that  F  of  the 

n,0  n 
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previous  paragraph  Is  F  .We  must  show  that  the  degree  of  the 

ri/  k 

extension  F  /F  is  n  for  l  ■  1,  . ..,  k;  or  In  other  words, 

n,  i  n,i-l 

n 

we  must  show  that  the  equation  y  -  p  Is  irreducible  in  F  . 

I  n,l-l 


STEP  2:  Proof  for  n  the  power  of  a  prime  q. 

There  are  two  cases,  q  ■  2  and  q  an  odd  prime.  We  can 

summarize  some  useful  theorems  ((46)  pp.  291*293,  Satz  7,8)  as 
fol lows: 

Let  q  be  a  prime  number,  -0  a  natural  number,  and  <*■  a  member 
of  a  field  F. 

Except  for  q  •  2  and  >  1, 


y  -  «  is  reducible  over  F  iff  04  where  fi  Is  in  F. 
For  q  ■  2  and  "0  >  1, 


y  -<*  is  reducible  over  F  Iff  oc  ■  ^  or  ■  -4y5 
where  fi  is  in  F. 


Note  that  within  our  context,  A  is  either  a  positive  prime 

number  (not  1)  or  an  irreducible  (or  square~free)  primitive 

polynomial  over  X.  V/hat  will  be  shown  here  is  that  the  degree  of 

F  /F  -  n. 

n, I  n,l-l 

n 

Consider  the  case  1*1.  For  y  -  p  to  be  reducible,  p 

1  1 

n°  4 

must  b z  p  ,  P  or  -4^  fcr  j3  in  F  »  JQ(X).  Any  of  these  possi- 

n,  0 

bilities  would  imply  that  the  polynomial  p  has  multiple  roots  in 

1 
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Fn,o‘  S'nC*  Pi  h3S  rMl  lnte*er  coefficients,  complex  roots  of 
n 

y  "  0  would  occur  only  In  conjugate  pairs,  and  real 

(integer)  roots  cannot  occur  since  p  is  irreducible  over  the 

1 

integers,  (if  p^  is  only  square-free,  any  real  roots  could  not 

be  multiple.)  Thus  the  polynomial  y°  -  p  is  irreducible  over 

F  .  1 

n,0 

The  rest  of  the  proof  is  by  induction  on  i,  the  index  of  p, 
for  i  -  2,  ...,  k. 

Assume  that  previous  polynomial  extensions  have  been 
successfully  adjoined.  We  must  show  the  Impossibility  of 
q  2  4 

p  *  /  ,  or  -4/ 

I  ♦  1 

where  is  in  F  ,1-2 . k-1. 

n,  1 

in  fact,  we  will  even  allow  £  to  be  In  G  -  F  (^  )  without 

n,  I  n, i  n 

changln*  the  results.  Clearly,  If  a  polynomial  Is  Irreducible  In 

G,  It  Is  irreducible  In  the  corresponding  F.  The  proofs  for 
4 

and  -4 p  are  essentially  similar  to  the  proof  below  for  ft*. 

Let  us  assume  the  falsity  of  the  theorem.  Then  p  -  £ 

i*l 

where  is  in  G  .  Then  we  can  express  R>  in  this  form: 
n,  i  r 
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(4. 7.2-1)  fi 


I 

n 

c  e 
•  • 

j-1 


c  €  G 
j  n,  0 


If  all  but  one  of  the  c  is  zero,  p  -  c  Q.  e  **  which 

j  1*1  j  j 

contradicts  the  hypothesis  that  p  is  irreducible  over  the 

1*1 

integers.  It  even  contradicts  the  weak er  hypothesis  that  p  I 

I  *\ 

square-free  and  relatively  prime  to  each  of  the  other  poly¬ 
nomials.  The  contradiction  Is  obtained  as  follows:  e  **  must  be 


J 

rational,  since  it  Is  the  ratio  of  p  and  c  \  both  rational. 

!  a.  1  • 


3ut  then  p  has  factors  c  *  and  e  Q.  Even  if  c  Is  1,  p  and 
q  j  J  j  1*1 

ej  must  have  a  non-trivial  GCD,  since  their  ratio  is  then  1. 

Finally,  the  square-free  condition  on  p  eliminates  the 

i*l 

q 

possibility  that  e  Itself  is  (somehow)  rational. 

j 

Assume  there  are  at  least  two  terms,  c  ,  c  j*  0.  By  the 

j  h 


induction  hypothesis,  the  Galois  group  of  G  :G  is 

n,  I  n,  i  +  1 

transitive,  so  there  is  an  automorphism  ^  of  G  over  G 

n,  i  n,  i  +1 

where  Y(e  )/Y(e  )  t  e  /e  .  For  some  set  of  integers  f  r  ),  this 

J  h  j  h  m 
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automorphism  maps  e  into 

m 


,  rm 

e  .  Since 
m 


P  &  F  ,  the  q  qth  roots 
n ,  0 


of  ^  all  look  l!keu>V#  and  any  automorphism  wl  1 1  just  inter¬ 
change  the  roots.  Applying  ¥  to  equation  (4. 7. 2-1)  yields 
r  r 

(4. 7. 2-2)  °p  -  Y  c  0)  J  e 

7  1  •  • 

J  j 

Since  ^  Is  in  G  ,  equations  (4. 7.2-1, 2)  contradict  the  assumed 
n,  I 


linear  independence  of  the  e  over  G  . 

,  _  k  n,  I 

for  F 

n,  I 

2 

Since  the  argument  for  p  and  -4 ^5 
concludes  the  proof. 


The  same  result  holds 


is  quite  simi lar,  this 


4.8  Canonical  Forms 

Recall  that  Caviness  does  not  provide  a  canonical  form  for 
radical  polynomials.  Two  equivalent  expressions  may  be  non¬ 
identical  because  each  is  expressed  In  a  different  algebraic 
extension  of  fl(x).  In  our  simplified  form  (2)  we  are  dealing  with 
the  same  field  representatives  so  that  we  are  always  dealing  with 
the  same  extensions  for  those  expressions  which  can  be  put  into 
form  (2).  Thus,  except  for  minor  points  dealing  with  such 
questions  as  the  ordering  of  terms  In  a  sum  or  product,  form  (2) 
is  a  canonical  form  for  that  subset  of  P  which  can  be  represented 
in  form  (2),  With  suitable  recombinations  of  square-free 
factors,  form  (1)  can  also  be  made  canonical. 
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One  of  the  consequences  of  this  canonical  form  is  that  we 

can  define  a  content  and  primitive  part  for  a  simplified  radical 

polynomial.  Let  f  -  ZCX),  where  Z  Is  the  ring  of  Integers.  Then 

E,  a  simplified  member  of  P  is  In  F[v  ,  ...,  v  ].  The  content  c 

1  k 

of  E  is  a  polynomial  In  G  such  that  E  -  c-P  where  P,  the 

primitive  part,  is  a  member  of  G  whose  coefficients  in  G  are 

relatively  prime  polynomials  (or  integers).  This  c  Is  unique  up 

to  a  unit  multiple,  (see,  e.g.  (27),  p.  366) 

Now  we  may  draw  a  few  conclusions  about  the  canonical 

properties  of  the  radical  expressions  with  rationalized 

denominators.  Basically,  we  wish  to  show  that  the  denominator 

Is  unique,  regardless  of  the  order  In  which  the  v  are  removed. 

I 

in  fact,  the  value  of  the  denominator  is  not  unique,  but  it  can 
be  made  unique. 

With  the  use  of  this  concept  of  content,  we  can  transform  a 
radical  expression  with  a  rational  denominator  into  a  reduced 
radical  expression;  one  such  that  the  content  of  the  numerator 
has  no  factor  In  common  with  the  denominator.  Any  such  expres¬ 
sion  can  then  be  written  as  c*N/D  where  c  and  D  are  relatively 

prime  polynomials,  and  N  is  a  radical  polynomial  In  simplified 
form. 

Theorem  4.8-1 

If  P  and  p'  are  reduced  radical  expressions  with  numerators 
in  form  (2)  such  that  P'  ■  p ,  then  P  <  P'. 
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Proof 

Let  P  z  c*N/D  and  P'  ■,  c’-N'/D'  as  above.  Cross  multiply 
to  get  c«N*D'  ■.  c'-N'-D.  The  primitive  part  of  two  equivalent 
radical  polynomials  must  be  the  same/  so  N  *  N*.  But  then  c/D 
and  c'/D'/  both  rational  expressions/  are  clearly  Identical  also. 

4.9.  Additional  Radical  Proposals 

An  extension  to  variable  exponents  has  been  made  in  the 
simplification  algorithm  described  in  section  4.6.  In  order  to 

x 

make  more  apparent  the  relationships  between/  for  example/  e  and 
x/2 

e  /  non-numeric  exponents  of  the  base  e  are  collected  and 
examined  for  common  factors.  Bases  different  from  e  are  con- 
x  x  log(y) 

verteds  y  -->  e  ;  the  same  convention  of  positivity 

defined  in  section  4.3  is  used  to  choose  a  unique  branch  for  the 
log.  If  common  factors  are  discovered/  they  result  in  re-repre- 

x/2  x 

sentation  of  the  elements.  Thus  in  the  presence  of  e  ,  e  is 

x/2  2 

represented  by  (e  )  .  In  the  terminology  of  section  4.2/  we 

wish  to  establish  a  regular  field  description  for  the  expression 

in  terms  of  monomials.  One  consequence  of  this  would  be  that 

x  x/2  x/2 

e  ♦  2  e  +1  simplifies  to  e  ♦  1. 

Since  the  conditions  requiring  this  form  of  processing  are 

disjoint  from  those  of  the  radical  expressions  (i.e.  variable 
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exponents  are  not  allowed  In  ),  this  aspect  of  simplification 
does  not  Interfere  with  the  algorithm  of  section  4.6:  This  part 

of  the  extended  algorithm  is  completely  bypassed  unless  there  Is 
a  variable  exponent. 

To  outline  some  details  of  the  algorithm,  let  p  and  q  be 

polynomials.  Then  p*  Is  written  as  e*  l0fi(p)#  then  as 

(q  ♦....♦q)log(p...p) 

1  n  l  n 

e 

q  log(p  ) 

|  • 

and  then  as  a  product  of  terms  of  the  form  e  J 

If  q  Is  a  rational  (rather  than  polynomial)  expression.  It 
can  be  written  in  a  partial  fraction  expansion  form  (15)  as  a 

q  log(p) 

sum.  Some  of  these  terms  e  wl 1 1  be  recognizable  as  mul- 

...  *  .  Q/k  log(p) 

P  es  o  others,  say  e  .  They  are  rewritten  to  reflect 

this  fact,  and  the  expression  is  rationally  simplified  with 

respect  to  this  new  set  of  variables.  Further  efforts  in  this 

direction  are  described  in  (15).  Exponentials  are  of  particular 

Interest  because  when  extended  to  complex  arguments,  they  can  be 

used  to  express  sines,  cosines,  etc. 

4.10.  Conclusions 

A  simplification  procedure  for  radical  expressions  which 
involves  only  gcd  calculations  has  been  presented.  While 
Caviness'  version  of  a  similar  algorithm  "is 


so  encumbered  by 
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combinatorial  difficulties  that  It  cannot  be  considered  a 

practical  routine/'  (p.  72  (5))  the  algorithm  presented  here  Is 
east  I y  appl led. 

This  result  is  an  Improvement  of  Cavlness*  results  In 
several  ways,  although  It  is  not  as  general  as  his  approach. 

1.  Factoring  over  arbitrary  algebraic  extensions  of  the 
rational  field  Is  unnecessary;  in  fact,  only  greatest  common 

divisor  calculations  (of  polynomials  with  integer  coefficients) 
is  required. 

2.  It  demonstrates  that  the  only  cases  requiring  Cavlness' 
approach  are  expressions  which  contain  roots  of  -1. 

3.  The  Inessential  restriction  to  one  variable  is  deleted. 
In  terms  of  theoretical  advances,  the  PRI  formulation  of 

the  interpretation  of  radicals  allows  a  new  and  more  useful 

theorem  (4. 7.2-1)  concerned  with  the  canonical  properties  of  this 
simplified  form  to  be  proved. 

Extensions  of  the  algorithm  of  section  4.6  are  discussed  In 
terms  of  their  usefulness  and  practicality:  several  seem  useful 
enough  to  suggest  that  they  be  made  available  as  programs  In 
MACSYMA  or  similar  systems.  A  description  of  several  uses  of  just 
such  a  simplification  capability  have  been  described  In  Chapter 
3.  The  RADCAN  routine  In  MACSYMA  uses  the  easily  computed  form 
1'  of  section  4.5,  but  does  not  ordinarily  use  the  denominator 
rationalization  technique.  RADCAN  also  incorporates  the  addi¬ 
tional  Ideas  In  section  4.9,  except  for  partial  fraction  decom¬ 
position  of  exponents. 
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One  of  the  Important  consequences  of  this  simplification 
approach  Is  that  new  Integration  methods  developed  by  Rtsch  (40), 
(41)  which  rely  on  the  regularity  (In  the  sense  of  4.2)  of 
integrands,  can  be  implemented  with  relative  ease. 
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Chapter  5  -  Summary  and  Prospects  for  the  Future 

5.1  Summary 

In  Chapter  2,  The  User-level  Semantic  Matching  Facility  in 
MACSYMA,  we  have  described  a  project  both  in  man-machine  commu¬ 
nication/  and  In  analyzing  the  concept  of  an  occurrence  of  an 
instance  of  an  algebraic  pattern.  While  this  pattern  recognition 
capability  has  great  appeal  in  the  Initial  approach  to  solving  a 
new  problem/  it  often  yields  to  more  efficient  methods  when  the 
problem  is  better  formulated.  Nevertheless/  we  feel  It  is  an 
important  tool  to  place  in  the  hands  of  a  user  of  an  algebraic 
manipulation  system:  it  gives  a  handle  on  problems  difficult  to 
formulate  in  other  modes  and  allows  a  user  to  mold  a  system  tox 
conform  to  his  set  of  expressions,  and  operations.  We  tried  to 
clarify  what  we  feel  to  be  important  in  terms  of  semantic  pattern 
recognition;  merely  syntactic  patterns  leave  much  to  be  desired 
in  terms  of  ease  of  use  and  flexibility.  We  have  demonstrated 
several  types  of  applications  of  pattern  matching  extensions  to 
MACSYMA:  writing  programs,  writing  Markov-al gor I thm  rule-sets, 

and  modifying  the  simplifer. 

In  Chapter  3,  the  rational  function  representation  of 
MACSYMA  serves  as  a  vehicle  for  several  algorithms  which,  we 
believe,  correspond  to  notions  often  used  informally  and 
imprecisely  in  referring  to  operations  on  mathematical  expres¬ 
sions.  For  example,  the  notion  of  coefficient,  as  embodied  in 
RATCOEF  allows  a  mathematician  to  "collect  terms"  in  a  single 
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command*  This  would  be  quite  difficult  to  phrase  In  terms  of 
traditional  programming  languages,  or  for  that  matter,  formal 
mathematical  definitions.  This  imprecision  is  even  more  apparent 
in  the  problem  of  substitution,  where  the  most  obvious  aspect  of 
the  problem  is  Its  Inherent  ambiguity.  By  defining  the  RATSUBST 
algorithms  for  substitution,  while  drawing  on  the  power  of  the 
rational  function  programs  to  perform  global  transformations  on 
an  expression,  several  varieties  of  unamb I guous  substitutions  may 
be  performed.  Another  use  of  the  rational  function  repre¬ 
sentation,  in  the  SOLVE  programs,  has  had  consequences  in  many 
other  parts  of  MACSYMA.  Being  able  to  solve  for  roots  of  poly¬ 
nomials  (by  factoring,  or  through  radicals)  while  at  the  same 
time  solving  for  transcendental  roots  (by  inverting  functions), 
has  saved  other  programs  considerable  efforts.  SOLVE  forms  an 
effective  tool  for  finding  poles  and  singularities,  an  important 
task  for  limit  calculations,  definite  integration  (using  contour 
integration  and  residue  calculation)  and  series  expansions. 

In  chapter  4  we  have  described  new  simplification 
algorithms  for  radical  expressions,  one  of  which  uses  only  poly¬ 
nomial  greatest-common-divisor  operations  for  its  effectiveness. 
The  major  consequence  is  that  we  can  produce  a  canonical  form 
over  the  class  of  radical  expressions  not  requiring  roots  of  -1. 
The  Importance  of  the  simplification  algorithm  lies  in  two  areas: 
in  general,  simplification  to  a  canonical  form  is  valuable, 
especially  if  it  is  inexpensive;  secondly,  the  Rlsch  integration 
algorithm  depends  strongly  on  such  results. 
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5.2  Prospects  for  the  future 

The  research  described  in  this  thesis,  and  similar  work  is 
directed  toward  the  goal  of  making  man  more  creative  and 
computers  more  useful.  No  doubt  many  areas  of  computer  science 
will  benefit  from  a  better  understanding  of  how  knowledge  can  be 
incorporated  into  a  programming  system  for  mathematics. 

The  goal  of  a  computer  system  for  the  future  which  1)  Is  a 
drudge-work  slave  capable  of  doing  massive  calculations/ 
correctly  and  rapidly/  2)  Isa  mathematical  co-worker/  capable 
of  conversing  in  natural  notations  about  many  problems  from  a 
wide  range  of  disciplines;  and  3)  I s  an  encyclopedic  mentor# 
being  an  organized  collection  of  pertinent  facts  and  algorithms 
drawn  from  all  of  mathematical  analysis,  is  a  tempting  challenge. 

Reference  1,  the  proceedings  of  SYMSAM/2,  is  a  detailed 
description  of  the  present  state  of  the  art  In  algebraic  manipu¬ 
lation.  Of  the  papers  In  that  volume,  11  are  concerned  with 
applications  of  computers  to  group  theory,  a  branch  of  symbol 
manipulation  which  is  of  limited  interest  to  us  here.  Some  7 
papers  deal  with  applications  of  computers  to  problems  of 
interest  to  workers  in  celestial  mechanics,  relativity,  high- 
energy  physics,  and  other  areas  where  massive  problems  in  algebra 
occur.  Some  7  papers  discuss  the  present  state  of  new  or  revised 
computer  systems  for  algebraic  manipulation.  About  a  dozen  are 
concerned  with  basically  peripheral  issues:  specifying  the  syntax 


149  *• 


for  a  language  for  communicating  v/i  th  an  on-line  system  for 
algebraic  manipulation,  hand-written  Input,  transportabi 1 i ty  of 
such  systems,  etc.  A  significant  number  of  papers  (8)  describe 
developments  which  may  actually  benefit  all,  or  most,  algebraic 
manipulation  systems  of  the  future,  in  that  they  introduce  new 
algorithms  for  standard  operations  of  mathematics.  Often  these 
algorithms  represent  improvements,  in  terms  of  computation  time, 
of  many  orders  of  magnitude.  Furthermore,  the  theoretical  tools 
used  for  analyzing  the  times  taken  for  execution  of  algorithms 
have  been  considerably  refined.  A  major  factor  in  the  success  of 
these  algorithms  has  been  the  use  of  modular  arithmetic.  Four 
papers  on  simplification,  and  three  on  limits  and  integration 
completed  the  program. 

Many  of  these  techniques  and  manipulatory  algorithms 
described  at  SYMSAM/2  are  present,  or  are  being  implemented  in 
MACSYMA,  and  for  that  matter,  in  several  other  current  algebraic 
manipulation  systems.  By  joining  together  as  many  of  these 
advances  as  possible,  in  forms  which  are  easier  to  approach  than 
the  original  implementations,  we  hope  to  produce  a  system  which 
is  useful  both  in  applications,  and  in  exploring  the  problems  of 
algebraic  manipulation  in  general. 

During  the  development  of  MACSYMA,  a  number  of  applications 
came  to  our  attention  through  colleagues  at  M.l.T.  and  Harvard. 

As  a  rule,  we  have  avoided  the  so-called  "naive  user,"  but  have 
attempted  to  attack  problems  with  a  combination  team  of  a  pro- 
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grammer  familiar  with  MACSYMA  (the  author)  and  an  application 
specialist*  This  approach  has  sufficed  to  explore  several 
problems.  With  Prof.  Eytan  Barouch  of  M.i.T's  Department  of 
Applied  Mathematics/  several  problems  arising  in  statistical 
mechanics  were  attempted/  with  only  moderate  success:  each 
problem  was  reduced  to  a  solvable  one  whose  answer  was#  in 
retrospect/  not  sufficiently  accurate  to  be  interesting;  the  full 
problem  was  (at  least  without  new  insights)  demonstrated  to  be 
beyond  present  machine  capacity.  With  Mr.  Francis  Heile/  an 
M.l.T  physics  graduate  student/  a  masters'  thesis  (25) 
calculation  was  completed;  the  hand  calculation  of  the  same 
result  had  been  abandoned  because  of  Its  complexity.  This 
involved  the  computation  of  traces/  utilizing  symmetry  properties 
for  simplification. 

With  Mr.  Henry  Mok#  another  M.l.T.  graduate  student/  the 
arithmetic  statements  In  a  large  computational  program  for  use  in 
plasma  physics  research/  were  checked  (and  an  error  found  in  the 
previous  hand  calculations). 

Other  members  of  the  programming  group  have  tried  MACSYMA 
on  problems  that  have  come  to  their  attention;  that  few  produce 
directly  publishable  results  is  not  surprising  —  the  ratio  of 
good  ideas  to  bad  in  mathematics  is  probably  quite  small/ 
although  the  bad  ideas  often  require  just  as  mucn  investigation 
as  the  good.  It  appears  that  systems  such  as  MACSYMA  can  perform 
a  useful  service  if  they  only  weed  out  untenable  computational 
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approaches  more  rapidly  than  hand  computations.  They  are  only 
occasionally  called  upon  to  solve  problems  to  completion  by  long 
and  complex  computation. 

Where  can  we  go  from  here?  In  the  first  place,  the  study 
of  particular  applications  is  a  necessity.  In  this  way,  further 
developments  of  systems  like  MACSYMA  can  proceed  In  the  most 
useful  direction,  rather  than  towards  the  unnecessarily  overgrown 
syntactical  and  semantic  prospects  that  seem  to  threaten 
developing  algebraic  manipulation  systems. 

Of  course,  since  applications  are  presumably  the 
justification  for  all  this  work,  efforts  In  making  such  systems 
more  readily  available  for  applications  may  be  of  major 
Importance.  These  efforts  should  Include  hardware  development 
(especially  wel 1 -desi gned  terminals,  cheaper,  faster,  list- 
processing  computers,  larger  memories),  and  system  development 
(especially  systems  for  management  of  large  programs  and  data¬ 
bases,  the  sharing  of  resources,  and  the  facilitation  of  man- 
machine  interaction). 

There  is  clearly  more  work  to  be  done  In  understanding  the 
methods  of  mathematical  analysis,  and  studying  alternative 
algorithms  for  common  tasks.  Large  strides  in  this  direction 
have  already  been  taken,  but  more  is  needed  in  understanding  such 
"simple"  questions  as  "What  is  the  fastest  method  for  raising  a 
polynomial  to  an  integer  power?"  (recent  work  by  L.  Helndel,  (22) 
and  separately,  W.  M.  Gentleman  (17)  draw  some  conclusions;  my 
own  work  In  this  area  may  be  found  in  (13)) 
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In  terms  of  making  a  more  versatile  mathematical  system,  we 
might  explore  the  construction  of  a  deductive  mathematical  system 
with  a  more  varied  data  base.  We  should  be  able  to  use  such 
facts  as  "the  second  derivative  of  F  with  respect  to  X  i s  0"  or 
"the  absolute  value  of  Y  is  less  than  or  equal  to  5."  No  current 
system  goes  very  far  toward  the  difficult  task  of  welding 
mathematical  facts  into  a  coherent  database. 

We  should  seek  to  extend  the  present  boundaries  of  the 
functions  which  we  can  handle  through  rigorous  methods.  Zero* 
equivalence  tests  for  handling  large  classes  of  functions  appear 
to  be  possible.  Stronger  results  concerned  with  nested  radicals 
may  also  be  possible. 

Better  algorithms  for  factoring  polynomials,  computing 
solutions  to  sets  of  linear  equations,  may  also  be  valuable 
goals,  but  it  may  be  that  heuristic  approaches  will  begin  to 
dominate  parts  of  the  currently  algorithmic  problem  domain.  By 
rearranging  the  ordering  of  variables,  enormous  speed  factors  can 
be  gained  in,  for  example,  computing  polynomial  greatest  common 
divisors.  Certainly  a  human  mathematician  would  explore  these 
other  methods  before  embarking  on  a  lengthy  calculation  --  why 
not  a  computer? 

Just  how  realistic  these  goals  are,  remains  to  be  seen. 

The  research  of  the  last  decade  has  revealed  both  unexpected 
successes,  and  surprisingly  basic  difficulties.  We  expect  that, 
as  in  the  past,  research  in  the  field  of  algebraic  manipulation 


will  continue  to  have  Important  implications  In  our  understanding 
of  mathematical  algor Ithms,  the  nature  of  computation  and 
artificial  intelligence/  programming  languages  and  systems,  and 
problems  of  man~machine  communication. 


154 


Appendix  I 

MACSYMA  Users'  Manual 

sri:  & 10  S' v— 'sps.'s' 

Ihl  mi  ,  d,?p'ayed‘  $  causes  the  command  to  be  evaluated 

H «V  p- r-F-*- 

ss^si.i^rTrirSuSsj'S'K  ?id£&he  who,e  ~d- 

•'cc™:ds^1;aNribyeS%^d,pe^r!^1offX,pya«  ^ 

,nPUt  syntax  °f  MACSYHA'  ‘Hen  "ay'sMp  ahead 
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a°nfdM^|Trm|an0rne|raOPrsh|^?5wOf  wVT' 

and  Z  stand  for  any  expressions;  U  and  V  for  variables  ci' Ylf 
a rgumen t CthebQbWous ^manner <ta*<e  *"  « 


,NPUT  MEANING 


AB 

'AB 

1 

1.2 

F[X, Y] 

F(X,Y) 

F [X, Y]  ( W,  Z ) 

XI 

X**Y  or  XfY 

X/Y 

-X 

X+Y 

X-Y 

X*Y 

X-Y 

X<Y,  X>Y,  X>-Y, 

X<-Y 
X  AND  Y 
X  OR  Y 
'X 

rx,Y] 

IF  X  THEN  Y 
IF  X  THEN  Y  ELSE  W 
FOR  1:1  STEP  1 
UNTI L  I >3  DO  X 
A:  X 
V: : X 
A( V) : -X 

A( V) : Y  FOR  ALL  W 
X.Y 

A  FOR  INTEGER  3<X<10 
A  FOR  3<X< 10 
BLOCK(X,Y,Z) 

X,Y,Z 


variable 
quoted  variable 
integer 

floating  point  number 

subscripted  variable 

function  Invocation 

subscripted  function  invocation 

factorial 

exponent  I  at  ion 

quotient 

negation 

sum 

di fference 
product 

equality  predicate  or  equation 

less  than,  greater  than,  less  than  or  equal  to, 

i !^ef i e.r.ihan  ?r  equal  t0  Predicate  or  inequality 
logical  AND  or  Boolean  operator 

logical  OR  or  Boolean  operator 

quoted  expression 

list  of  express  I ons 

condi tlonal 

cond I t lonal 

DO  loop 


assign  the  value  of  X  to  A 

assign  the  value  of  X  to  the  value  of  V 

define  function  A(V) 

define  function  A(V) 

non-commutat Ive  product 

indexed  set 

real  line  segment 

program  block  <X,Y,Z  are  statements) 
EV(X,Y,Z):  X  Is  an  expression, 

Y, Z  specify  environment 
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1.1  The  Input  Stream  Editor 


At  any  point  while  he  is  Inputting  a  comand/  the  MACSYMA 
user  can  enter  the  input-stream  editor  by  typing  #.  The  editor 
is  given  the  string  of  characters  typed  so  far  in  the  current 
command.  In  the  case  of  a  detected  syntax  error,  the  entire 
previous  command  string  will  be  given  to  the  editor. 

All  the  commands  to  the  editor  reference  a  cursor  (an 


underscore  or  back-arrow,  depending  on  the  console)  which  is 
played  within  (or  at  either  end  of)  the  string  of  characters 
under  edit.  In  the  description  to  follow,  n  stands  for  a 
positive  or  negative  integer.  The  default  value  of  n  is  +1. 
is  positive,  the  commands  operate  toward  the  right  of  the 
if  n  is  negative,  they  operate  toward  the  left. 


d  i  s- 


If  n 
cursor; 


nC 

nR 

J 

nL 

Sst  r i ng# 


nD 

nK 


I  string# 
# 


moves  the  cursor  n  characters. 

moves  the  cursor  n  characters  in  the  reverse 

di rect ion(nR*-nC) . 

(also  T)  moves  the  cursor  to  head  (top)  of  string, 
moves  the  cursor  to  the  r{ght  of  the  nth  carriage  return 
(e.g.  L  moves  to  the  next  line) 

moves  the  cursor  to  the  right  of  the  first  occurrence  of 
the  string  of  characters  "string"  searching  toward  the 
right .  (-S  impl ies  left) 
deletes  n  characters. 

deletes  all  the  characters  through  the  nth  carriage 
return,  (e.  g.,  K  deletes  the  remainder  of  this  line) 
inserts  the  characters  "string" 

leaves  the  editor  and  returns  to  inputting  from  the 
user's  console. 


I .2  System  Control 

Lines  are  consecutively  numbered,  except  that  the  input 
line  Ci  will  be  following  by  an  output  line  (if  one  is  generated) 
named  Di.  The  next  input-output  pair  will  be  labelled  C(i+1)  and 
D(i+1),  respectively.  If  one  command  produces  several  lines  of 
output,  the  labels  will  begin  with  an  E,  and  the  line  number  will 
be  incremented  for  each  additional  line.  A  user  can  refer  to  any 
command  or  expression  by  its  line  label.  The  most  recently 
computed  expression  may  be  referred  to  as  "%". 

The  system  can  be  set  to  automatically  write  old  expres¬ 
sions  onto  secondary  storage.  The  process  is  controlled  by  the 
following  variables  which  can  be  set  by  the  user.  (e.g. 

F I LES I ZE s 10$  would  set  FILESIZE,  or  by  the  OPTIONS  command.) 
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variable 

default  value 

Purpose 

DSKUSE 

FALSE 

if  set  to  T,  then  expressions  are 
automatically  filed  aw~y. 

Fi LESIZE 

10 

Expressions  are  written  out  with 
FlLESiZE  expressions  In  each  file. 

RETAiNNUM 

8 

When  the  number  of  expressions  in  core 
reaches  FlLESiZE  ♦  RETAiNNUM,  a  file  is 
wr  i tten. 

FILENAME 

username 

The  first  name  of  the  file  written  out. 
The  second  names  (our  filing  system 
requires  two  names  for  a  file)  are 
1,2,.... 

iNCHAR 

C 

The  prefix  character  for  inputted  line 
numbers . 

OUTCHAR 

D 

The  prefix  character  for  outputted  line 
numbers . 

LINECHAR 

i  •< 

E 

• 

The  prefix  character  for  intermediate- 
output  line  numbers. 

•  •  .  .  .  a  •  m  «  « 

When  an  expression  is  written  out,  the  name  of  the  file 
containing  it  is  attached  to  the  expression  name  in  core.  Thus 
when  the  expression  is  referenced  in  a  later  step,  it  can  be 
automatically  retrieved  from  the  file. 


At  the  end  of  the  session,  the  seconda  y  storage  files  can 
be  deleted  by  the  command  FINiSHO.  The  command  FINISH(TRUE) 
allows  the  user  to  retain  some  or  all  of  the  expressions  jn  his 
file,  in  order  to  specify  the  form  and  contents  of  the  retained 
file,  he  must  answer  a  series  of  questions: 


ques-tion  response  meaning 


OUTPUT  DEVi CE?(f i 1 e  spec) 
EDIT?  N 

Y 

INTERNAL?  Y 

N 

Y 

N 


The  name  of  the  file  on  which  the 
output  will  be  saved. 

Save  the  files  as  they  are.  This 
response  will  cut  off  further  questions. 
Read  the  files  back  into  memory,  one  ex¬ 
pression  at  a  time,  so  that  selected  ex¬ 
pressions  can  be  saved  on  the  previously 
speci f ied  file. 

Save  the  expressions  in  machine  readable 
form,  in  this  form  they  may  be  read 
back  into  a  fresh  system  using  RESUME. 
Save  only  the  two  dimensional  display 
forms . 

(This  is  asked  for  each  expression.) 
include  the  expression  currently  dis- 
pl ayed. 

Do  not  i ncl ude  i t . 


SAVE? 
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RESUME C file  specification)  reads  a  file  previously 
outputted  through  FINISH,  displaying  the  commands  and  recomputing 
the  results. 

BATCH( file  specification)  reads  an  input  text  from  the 
designated  file,  command  by  command.  When  the  end  of  the  file  is 
reached,  further  commands  may  be  supplied  by  the  user  at  his 
console.  DEMONSTRATE  is  like  BATCH  except  it  pauses  waiting  for 
the  user  to  type  a  space  between  commands.  Any  other  character 
causes  a  return  to  the  user  console  for  further  input.  Add*- 
tional  arguments  may  be  supplied  to  cause  some  of  the  input  lines 
to  be  skipped. 


1.3  Rules  for  Expression  Evaluation 

We  have  attempted  to  define  a  philosophy  of  expression 
evaluation  in  MACSYMA  so  as  to  lead  to  the  most  natural 
mathematical  usage.  In  so  doing,  we  necessarily  become  involved 
in  a  complicated  set  of  rules  to  (in  part)  replace  explicit 
quoting  mechanisms,  such  as  that  in  LISP.  If  we  have  been 
successful,  a  mathematician  should  rarely,  if  ever,  have  to  refer 
to  the  rules  concerned  with  noun  and  verb  forms,  below. 

A:X  assigns  A  the  value  of  X.  This  is  the  way  a  user  would 
typically  assign  a  value  to  a  variable.  Values  are  also  assigned 
when  the  variables  are  used  as  labels  for  expressions  on  input 
and  output.  This  assignment  is  analogous  to  the  FORTRAN  or 

the  LISP  SETQ. 

A  variable  which  does  not  have  a  value  stands  for  itself. 
Numbers  always  stand  for  themselves. 

A  built-in  or  user-defined  function  is  either  a  noun¬ 
function  or  a  verb-function.  A  verb-function  is  one  which 
attempts  to  effect  an  applicaton  of  the  function  to  its  arguments 
and  thereby  remove  itself  from  the  expression.  For  example, 
INTEGRATE  is  a  verb-function,  and  ordinarily  will  attempt  to 
perform  an  integration.  On  the  other  hand,  SIN  is  a  noun¬ 
function,  and  will  not  attempt  to  evaluate  itself,  although  it 
will  evaluate  its  arguments.  The  EV  command  can  be  used  to 
evaluate  an  expression  in  a  context  which  says'that  (for  example) 
all  SINs  should  be  numerically  evaluated.  That  is,  selected  noun 
forms  can  be  converted  to  verb  forms.  Similarly,  if  a  normally 
verb-type  function  is  desired  to  operate  as  a  noun-type  function, 
it  may  be  so  declared  via  the  function  NOUN.  Thus  INTEGRATE, 
when  declared  a  noun,  would  normally  return  an  integral,  even  if 
the  integration  could  be  performed.  If  the  function  F  is  a  verb, 

'  F  can  be  used  as  the  noun  form  for  F.  If  F  Is  a  noun,  * ' F  (not 
"F)  can  be  used  as  the  verb  form.  If  a  verb-function  cannot  be 
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It  u  Si!  '  l  e5amp,e'  an  integral  which  cannot  be  computed, 
t  is  simply  returned  as  though  it  were  a  noun-function.  if  F  is 
already  a  noun,  *F  is  the  same  as  F. 

are  vl^b^un?^^!  tunctions  are  n°u"s-  Other  defined  functions 
f.nLftnH  *  ?  thelr  names  are  quoted.  The  arguments  of 

undefined  functions  are  evaluated,  but,  obviously,  the  function 
itself  cannot  be  evaluated,  and  so  Is  treated  as  a  noCn  As  an 
expression  is  evaluated,  it  is  also  simplified. 

If  a  name  is  subscripted  (a  subscript  is  enclosed  in  sauare 
brackets  on  input),  then  its  value  Is  stored  In  an  array.  The 

?n^mA°ci^n^2rrayAmay  be  declared  by  the  command  ARRAYSIZE 

u  An  arr?y  need  not  have  ‘‘ts  dimensions  declared, 
but  if  it  has  been  declared,  it  will  be  permitted  to  have  only 

JJTnniW  su^scripts*  At  the  first  attempt  to  store  a  value  Tn 
an  undeclared  array,  a  mechanism  will  be  set  up  to  describe  the 
entries  and  their  values  in  terms  of  a  hash-coded  list.  The  hash 
nCmerV^ibe  p?mputed  fro™  the  subscripts  whether  or  not  they  are 

the  't*h  I  30  a7ay  iS  subse<iuently  declared,  the  values  in 
the  hash  table  are  transferred  to  the  new  (true  array) 

prlsslon*  'pa‘aflhe  Yalue  °f  an  array  entry  can  be  a  number,  ex¬ 
pression,  equation  (etc.)  regardless  of  whether  it  is  a  hash 

ur?s  ?ni  t  fan  v  anay\  i  hafhed  array  ,s  organized  as  follows: 
Its  initially  allocated  a  hash  table  with  four  entries.  Each 

ntoethSttre„^ta'SS  3  "St  sub?cr,»ts  values  «b'ch  hashes 
into  that  entry.  Whenever  the  number  of  entries  with  values  is 

?  he  si«  ^sh  table,  the  site  of  the  hash  Hbll 

is  doubled.  Whenever  the  operation  is  executed,  a  check  Is 

array 'entry  !I  Si"™  '*  ti^pp^e ' 

A: : X  assigns  the  value  of  A  the  value  of  X.  The  value  of  A 
ulp  set?  Var,ab,e  ,n  thls  sl  tuatl*on-  This  is  a™ logout  to  a 

i.4  Function  Definitions  and  Arrays 

60  fnrMncfYMA  ' "co rpor a  * e s  *  programming  syntax  resembling  Algol - 
60  for  use  on  the  top  (command)  level  and  in  function  defini- 
tions.  The  parser  is  entirely  syntax  directed,  so  that 

PxacrH^!n-f-t0  *5®  8rammar  can  be  easily  included;  also,  an 
exact  definition  of  the  acceptable  forms  (and  their 

tlff?rno^aJi0n?  50  °f  USP  and  MACSYMA  functions)  can  (but 

C  !  given‘  The  syntax  is  illustrated  in  figure  1.3. 

Each  of  these  constructions  has  fairly  conventional  interpreta- 

f'ict  eX0np  ^hn-SVTb01  iC:an^  trad,tional  numeric  notions  con- 
h  ext°  pr  nh  .instance  is  in  inequalities,  and  is  discussed  in 
tne  next  section  in  more  detail. 
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Tht  first  argument  to  Mi-,(  (the  function  definition 
operator)  m&y  take  one  of  three  form*  ft  it),  fflj  of 
Let  tht  iecon<j  argument  to  (that  U*  the  right  hind  side) 

be  y.  in  the  first  case,  the  variable  f  denotes  a  function* 
with  value  1  ambdaC x)  y.  Jn  the  second  case,  *  function  definition 
Is  be ln§  associated  with  an  array.  The  name  f  ft  denoted  an 
afx PR  with  value  UmbdaCDy.  An  AEXPR  Ss  used  ns  follows,  if  a 
particular  value  of  an  undeclared  array  (it  is  an  array  if  the 
variable  is  subscripted  or  if  the  name  has  previously  been 
subscripted  and  assigned  a  value)  is  not  present  in  the 
associated  hash  table*  a  check  is  made  to  see  if  the  name  also 
denotes  an  AEXPR.  If  so*  this  function  is  evaluated  and  the 
resulting  value  is  stored  in  the  hash  table  and  also  returned. 

If  no  value  is  present  and  no  AEXPR  is  present*  the  expression  is 
handled  as  though  it  were  an  undefined  function. 

If  the  first  argument  of  is  f[i](x)*  the  third  case, 

then  f  is  denoted  an  AEXPR  as  above*  but  this  AEXPR  evaluates  to 
a  function  of  x.  For  example*  given  f [ i] ( x) : -x** i *  evaluating 
f[3](5)  would  cause  the  AEXPR  to  be  evaluated  to  1 ambda(x)x**3 
and  this  value  would  be  stored  as  the  value  of  f[3]  and  also 
applied  to  5  to  yield  5**3.  A  subsequent  evaluation  of  f[3](7) 
would  cause  the  value  1 ambda(x)x**3  of  f[3]  to  be  retrieved  and 
appl ied  to  7. 

The  second  argument  to  :=  (the  right  hand  side*  or 
procedure  body)  is  ordinarily  left  unevaluated.  This  may  be 
altered  by  the  use  of  the  double  quote  (")*  which  causes 
immediate  evaluation.  Thus  F(X):»"%@  uses  the  most  recently 
computed  expression  as  the  procedure  body. 

If  several  expressions  or  commands  are  included  in  the 
procedure  body*  and  dummy  variables  are  needed*  the  correct  form 
is*  for  example* 

F(X):«BL0CK( [Y]  *  Y : 1*  A*  IF(Y>X)  THEN  RETURN(Y)  ELSE 
0 1  SPLAY ( Y) *  Y: Y+l*  G0(A)) 

This  is  equivalent  to 

F(X):-F0R  Y : 1  STEP  1  UNTIL  Y>X  DO  D I  SPLAY ( Y ) 

1.5  Predicates  and  Conditionals 

The  comparison  operators  "<"*  and  are  not  evaluated 

in  ordinary  contexts;  that  is*  they  are  nouns.  However*  these 
operators*  along  with  AND  and  OR  are  evaluated  when  they  are  in 
the  predicate  position  of  the  IF-THEN-ELSE  construction;  that  is* 
they  are  transformed  into  verbs.  If  the  predicate  evaluates  to 
FALSE*  the  ELSE  clause  is  evaluated  and  returned.  Otherwise  the 


161 


THEN  clause  is  evaluated  and  returned. 


i.6  Special  Constants 

There  are  a  great  many  special  constants  and  functions  in 
mathematics  which,  if  given  their  common  names,  would  pre-empt 
many  of  the  letters  of  the  alphabet.  To  avoid  various  types  of 
misunderstandings,  we  have  chosen  the  symbol  as  an  "escape" 

character  for  special  symbols.  Thus  the  base  of  the  natural 
logarithms,  e,  is  typed  into  MACSYMA  (and  displayed  by  MACSYMA) 
as  %E.  Other  symbols  in  this  category  include  %Pt  (the  ratio  of 
the  circumference  to  the  diameter  of  a  circle),  and  %\ ,  the 
square  root  of  -1. 


1.7  General  Purpose  Commands 

1 NTEGRATEfexp. var)  integrates  exo  with  respect  to  var  or  returns 
an  integral  expression  if  it  cannot  perform  the  integration. 

1 NTEGRATE(exp. var.  low. high)  finds  the  definite  integral  of 
exp  with  respect  to  var  from  low  to  high.  Several  methods 
are  used,  including  direct  substitution  in  the  indefinite 
integral  and  contour  integration,  improper  integrals  may  use 
the  names  INF  for  positive  infinity  and  MiNF  for  negative 
infinity.  if  an  integral  "form"  is  desired  for  manipulation 
(for  example,  an  integral  which  cannot  be  computed  until  some 
numbers  are  substituted  for  some  parameters),  the  noun  form 
'INTEGRATE  may  be  used. 

Pi FF( exp.varl.nl. . . . .vark, nk)  differentiates  exp  with  respect  to 
var 1  n i  times.  if  k»l  and  nl»l.  nl  may  be  omitted: 

PI FF(exp, var) .  if  the  derivative  "forms"  are  required  (as, 
for  example,  when  writing  a  differential  equation),  ' P i F F 
should  be  used. 

PEPENPENCi  ES(H, . . . ,ln)  declares  functional  dependencies  used  by 
PiFF.  Each  f±  (i»l,n)  has  the  format  f (^1, . . . ,Yjn)  where  each 
v i  (j=l,m)  is  a  variable  on  which  f  depends.  Thus  DiFF(Y,X) 
is  0,  initially.  Executing  PEPENDENCi ES(Y(X) ) $  causes  future 
differentiations  of  Y  with  respect  to  X  to  be  displayed  as 

PY 

PX 


GRAPEF(f (xl . xn) .gl(xl) - ,gn(xn) )  defines  the  derivatives  of 

the  function  £  with  respect  to  i ts  n  arguments.  That  is, 
df/dxjL  =  gl(xl) ,  etc. 
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UMI T(£2^/^^/yiaJ./4jjj_t,)  finds  the  limit  of  exp  as  the  real 

variable  ^aJL  approaches  the  value  val  from  the  direction  dl r. 
J2-1-J1  may  have  the  value  PLUS  for  a  limit  from  above,  MINUS  for 
a  limit  from  below,  or  may  be  omitted  (Implying  a  two-sided 
limit  is  to  be  computed).  LIMIT  uses  the  following  special 
symbols:  INF  (positive  infinity)  andMINF  (negative 

infinity).  On  output  it  may  also  use  UNO  (undefined)  and  I ND 
(indefinite  but  bounded).  'LIMIT  may  be  used  to  simply 
create  a  limit  noun  form. 


RES  I DUE(j»2ig/.Yii£/Yal,flLr.der )  computes  the  order th  residue  in  the 
complex  plane  of  the  expression  exp  when  the  variable  va r 
assumes  the  value  va 1 . 

SUBSTI TUTE(a,Ji, c)  substitutes  a  for  In  £.  £  must  be  an  atom  or 

a  function  with  arguments,  rather  than  a  function  with  only 
some  of  its  arguments.  When  does  not  have  these  charac¬ 
teristics,  one  may  sometimes  use  SUBSTPART  or  RATSUBST. 

SUBSTI  TuTE(£al,£2U2)  or  SUBSTI  TUTE(  r.e.ql . eakl  .exp)  are 

other  permissible  forms.  The  fifll  are  equations  indicating 
substitutions  to  be  made.  For  each  equation,  the  right  side 
will  be  substituted  for  the  left  In  the  express  ion  **£  (if 
the  left  side  Is  non-atomic,  and  the  right  side  is,  the 
equation  will  be  "flipped") 


EXPAND(£iip)  will  cause  an  expansion  of  the  argument.  The  MACSYMA 
variables  MAXNEGEX  and  MAXPOSEX  (originally  set  to  6)  control 
the  maximum  negative  and  positive  exponents,  respectively, 
which  will  expand.  EX PANO (£j8Lja, expands  exp,  but 
for  MAXPOSEX  and  n  for  MAXNEGEX. 


uses  £ 


SIMPLIFY(£X£)  simplifies  Its  argument,  thus  overriding  the  value 
of  the  MACSYMA  variable  SIMP  which  if  set  to  FALSE  stops  sim- 
pl if ication. 


PARTijjjsji, Hi, . . . , xiR )  obtains  a  subexpression  of  exp  which  is 
specified  by  the  indices  ai.  The  index  nl  which  (like  all 
the  indices  is  a  non-negative  integer)  selects  the  argument 
of  the  top  level  operator  of  corresponding  to  Its  value. 

Thus  PART(Z+Y,2)  yields  Y.  The  index  n2.  (if  specified) 
picks  up  an  argument  of  the  result  of  PART(exo, nl) .  Thus 
PART(Z+2*Y, 2, 1)  yields  2.  The  operator  is  considered  to  be 
argument  0. 

In  exponentiation,  the  base  Is  considered  argument  1, 
and  the  exponent  argument  2.  In  a  quotient,  the  numerator  is 
argument  1,  and  the  denominator  is  argument  2.  A  minus  sign 
appearing  in  the  display  is  considered  as  an  operator.  For 
example 
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(Cl)  X+Y/Z**2@ 

(Dl) 

(C2)  PART(D1,1,2,2)@ 
( 02 ) 


Y 

—  ♦  X 
2 
Z 


2 


(£&fi/nl/ . .  :/ali)  selects  the  same  subexpression  as  PART,  but 
?k  J uf c  return 1 n«  that  subexpres s i on  as  its  value,  it 

d?sD^vpdh?n^?Hle  e£pressl2n  Wltl?  the  selected  subexpression 

va?Sl  nfdt-i  d?  b5X#  Tbe  variable  *PART  is  given  the 
value  of  the  selected  portion.  Thus  in  the  example  above. 


( C2 )  OPART (Dl,  1,2, 1)@ 
(D2 ) 


2 

***** 

*  Z  * 
***** 


♦  X 


SUBSTPART(A,fiAB/Ol,  .  .  .,ILk)  substitutes  A  for 

picked  out  by  the  rest  of  the  arguments, 
va 1 ue  of  exp. 


the  subexpression 
I t  returns  the  new 


K,LLcifS/*  ;i'-aL^).eUm?nates  Its  arguments  from  the  MACSYMA 

nlmp6  rhif^arf  S  !!  Yar?ab,e'  a  function  name,  or  an  array 
name,  the  designated  item  is  removed  from  core  and  the 

storage  it  occtpies  is  reclaimed,  arc i  *  "HISTORY" 
eliminates  all  input  and  output  lines  to  date  (but  not  other 
named  items).  argi  =  a  number,  n,  deletes  the  last  n  lines. 

ST0R^?^ lsf5lmMar  to  K,LL  ' n  that  it  reclaims  core 

to°STORF  ar!  *  qUit?  35  mUch><  The  values  of  the  arguments 
ST0R!  are  removed  from  core  and  saved  on  a  secondary  stor- 

readdback6th  Spe<r‘al  Indicators  left  in  core  allow  MACSYMA  to 
a  1  „  Whenever  referenced.  The  arguments  can 

"H I STor Y"  !rp  InT  "a|"?s'  0r,  array  references.  Numbers  or 
HISTO.Y  are  not  acceptable,  since  storage  of  the  input  and 


output  lines  is  automatic  and  controlled 


by  RETAINNUM. 


sfmply  backs  up  expressions  on  disk,  but 
leaves  them  in  core  as  well. 

C0EFF(£AJ2/^fl£/n)  obtains  the  coefficient  of  var»»n  in  exo  For 
best  results,  should  be  expanded.  ^  must  be  an  integer 
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or  a  rational  number.  Coefficients  of  var»»n  which  are 
functions  of  var  are  ignored.  This  command  is  less  powerful 
than  RATC0EF/  but  is  sometimes  convenient  in  interactive 
si tuations. 

( C2 )  COEFF(Y+X*%E**X+1,X,0)@ 

<02)  Y  +  1 


SUNK  exp, j nd, 1o, h i )  performs  a  summation  of  the  values  of  exp  as 
the  index  Ind  varies  from  JLq.  to  hj_.  if  the  summation  cannot 
be  performed,  or  if  'SUM  is  used,  the  value  is  a  sum  noun 
form  which  is  a  representation  of  the  sigma  notation  used  in 
mathemat i cs . 

< C 3 )  SUM( 1**2, 1 ,1,4)@ 

(D3)  30 


PRODUCT  (analogous  to  SUM  above). 

EV(&&P-/aji&l, .  ,  . ,argn)  causes  the  expression  £&£  to  be  evaluated 
and  simplified  with  switches  set  according  to  the  values  of 
the  a rg i . 

EVAL  reevaluates  the  expression  so  that  variables  in  it 
which  have  values  will  be  evaluated. 

SIMP  overrides  the  setting  of  the  SiMP  switch. 

EXPAND  causes  expansion.  EXPAND(n,xn)  set  the  values  of 
MAXPOSEX  and  MAXNEGEX . 

DIFF  causes  all  differentiations  indicated  to  be 
performed.  PI FF(varl, . . . .vark)  causes  only  differentiations 
with  respect  to  the  indicated  variables. 

NUMER  causes  SIN,  COS,  LOG,  and  "**"  with  numerical 
arguments  to  be  evaluated. 

vgexp  causes  the  substitution  of  exp  for  y. 

Any  other  function  names  (e.g.  SUM)  cause  evaluation  of 
occurrences  of  those  names  as  though  they  were  verbs. 

The  arguments  following  the  first  (exp)  may  be  given  in 
any  order.  It  should  be  understood  that  EV  performs  a  single 
evaluation  and  simplification.  Thus  all  of  the  functions  are 
performed  in  one  scan.  This  is  possible  because  the  simpli¬ 
fier  is  used  to  perform  expansions,  differentiation,  and 
numerical  evaluations  by  the  setting  of  switches.  For 
example; 


( C4)  SIN(X)  +  C0S(Y)  +  (W+1)**2+'DIFF(SIN(W),W)@ 
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D  2 

( D4 )  COS ( Y)  +  SIN(X)  +  --SIN(W)  +  (W  +  1) 

DW 

(C5)  EVU,NUMER,  EXPAND,  D I  FF,X=2,Y=1)@ 

2 

( D5 )  COS(W)  +  W  +  2  W  +  1.425324 

An  alternate  syntax  has  been  provided  for  EV,  whereby  one  may 
just  type  in  its  arguments,  without  the  EV().  That  is,  one 
may  write  simply  exp.argl. .... argn. 

WHEN  conditional  DO  i dent  i  f i er  ■  express  1  on  e.g.,  WHEN  i*2  DO 
The  value  of  the  i dent i f i er  is  determined  by 
evaluating  the  condi tional .  If  it  evaluates  to  TRUE,  then 
the  expression  is  evaluated  and  used  for  the  value  of  that 
use  of  the  i dent i f i er .  If  the  condi tional  evaluates  to 
FALSE,  then  the  i dent i f i er ' s  value  is  itself,  in  effect,  the 
i dent i f i er  becomes  a  function  of  no  arguments  which  evaluates 
the  condi tional ,  and  if  TRUE,  returns  the  expression  as  its 
value.  Thus  WHEN  TRUE  DO  EXPAND=EXPANDU)G>  makes  the  atom 
EXPAND  always  evaluate  to  the  last  computed  expression, 
expanded. 


S0LVE(exp. var)  solves  the  algebraic  equation  exp  for  the  variable 
y a r .  If  exp  is  not  an  equation,  it  is  assumed  to  be  an 

expression  to  be  set  equal  to  zero.  Var  may  be  a  function 
(e.g.  F(X)),  or  other  non-atomic  expression  except  a  sum  or 
product.  It  may  be  omitted  if  exp  contains  only  one  variable. 
Exp  may  be  a  rational  function,  and  may  contain  trigonometric 
functions,  exponentials,  etc.  Its  success  may  depend  partly 
on  switches  set  by  the  user.  (see  OPTIONS) 

SOLVE' rihsl, . . . , 1 hsn] . fvl. . . . , vnl )  solves  a  system  of  linear 
algebraic  equations.  It  takes  two  lists  as  arguments.  The 
first  list  ( I hs i .  i  =  l,n)  represents  the  equations  to  be 
solved;  the  second  list  is  a  list  of  the  unknowns  to  be 
determined.  If  the  total  number  of  variables  in  the 
equations  is  equal  to  the  number  of  equations,  the  second 
argument- 1 i st  may  be  omitted.  If  the  given  equations  are  not 
compatible,  the  message  INCONSISTENT  will  be  displayed.  If 
no  unique  solution  exists,  SINGULAR  will  be  displayed.  The 
solutions  are  exact,  asuming  the  user  has  not  used  floating¬ 
point  numbers  in  his  input,  and  may  involve  symbolic 
variables.  The  solution  set  consists  of  a  list  of  numbered 
equations  and  an  index  to  the  list. 

PI  SPLAY (ex pi, . . .,expn)  prints  equations  whose  left-hand-side  is 
the  exp,  and  whose  right-hand-side  is  the  value  of  the 
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expression.  The  value  of  DISPLAY  is  a  list  of  the  labels  of 
the  equations  displayed. 

( C7)  Dl SPLAY(D3, I )@ 

(E7)  D3  =  X  +  Y 

(E8)  |  =  5 

(D9)  [E 7 ,  E8] 


PL0T(£2LB)  produces  an  asterisk-plot  of  the  expression  exp .  Exp 
may  be  of  the  form  F(X)  FOR  1<X<10/  or  F(X)  FOR  INTEGER 
1<X<10,  or  [yl, . . . , yn]  or  F(X/Y/Z).  In  this  last  case/  the 
user  will  be  asked  to  define  the  dependent  and  independent 
variables/  set  the  extra  variables  to  constants/  and  provide 
the  domain  for  the  independent  variable.  If  the  list  of  Y- 
values  is  provided/  the  user  will  be  asked  for  a  list  of  the 
corresponding  X-values. 

GRAPH (xy.fl.l  §  /  y.valS/*lflbel  ,y  Label )  graphs  the  two  sets  of  data 
points/  and  labels  the  axes  as  indicated.  The  data  points 
can  be  lists  or  indexed  sets.  The  height  and  width  of  the 
display  is  affected  by  the  parameters  of  the  user-terml na 1 , 
or  can  be  alcered  by  the  use  of  OPTIONS. 

APPEND(X/j£)  appends  the  two  lists  x  and  jy  and  returns  a  single 
list  of  the  elements  of  &  followed  by  the  elements  of  jy. 

CONS(A,.y)  returns  a  new  list  constructed  of  the  element  a  as  its 
first  element/  followed  by  the  elements  of  .y. 

OPTIONS(ALg)  is  a  tree-structured  collection  of  option-describing 
and  option  setting  programs.  The  options  concern  the  setting 
of  switches  for  disk  usage,  display,  simplification,  the 
SOLVE  program,  etc.  OPTIONSCCATEGORI ES)  lets  the  user  in  at 
the  top  level.  See  figure  A. 2  below. 
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Figure  A. 2  OPTIONS 
CATEGORIES 

category^  category  NAMES  each  followed  BY  THE  OPTIONS  IN  THAT 

CURRENT  VALUE  IS 
[BOOKKEEPING, 

[DSKUS  E,  F  I  LESIZE/RETAI  NNUM,  F  I  LENAME ,  DEV,  UNAME ,  I  NCHAR. 

OUTCHAR,  LI  NECHAR,  Tl  ME,  CATEGOP.I  ES ,  RS ET,  NOUUOl , 

SIMPLIFY,  J/ 

[SlMP,MAXPOSEX,MAXNEGEX,%EMODE,TRIGSlGN,SUBSTFLAG,%ETOLOGFLAG], 

L/  I  W  •  L  r\  I  ^ 

[NOSTAR, DERI  VAT  I VEABREV, L I NEL, SCOPEHE I GHT, SQRTF LAG, 

EXPTDI  SPFLAG]  , 

RATIONAL, 

[FULLFLAG, NOREPEAT , I NVERTFLAG, FACTORFLAG, RADSUBST, MODULUS, 
BERLEFACT , GCDSWI TCH, GCDOFF, RAT EPS  I  LON] 

SOLVE, 

[SOLVE FACTORS, SOLVE RAD CAN, SOLVEHEURS]  ] 

♦♦♦BOOKKEEPING*** 

DSKUSE 

IF  TRUE  CAUSES  OUTPUT  FILE  TO  BE  OPENED 
CURRENT  VALUE  IS  FALSE 

FI LESIZE 

^_THE  NUMBER  OF  EXPRESSIONS  WRITTEN  TO  SECONDARY  STORAGE  IN  EACH 
CURRENT  VALUE  IS  10 
RETAINNUM 

THE  NUMBER  OF  EXPRESSIONS  IN  MEMORY  JUST  AFTER  A  SECONDARY 
STORAGE  FI LE  IS  WRITTEN 
CURRENT  VALUE  IS  8 

FI LENAME 

THE  FIRST  NAME  OF  FILES  OF  EXPRESSIONS  WRITTEN  TO  SECONDARY 
STORAGE 

CURRENT  VALUE  IS  usrxyz 

(where  USERNAME  begins  with  usr,  and  where  xyz  Is  a  random 
number) 


DEV 

THE  DEVICE  USED  FOR  FILING 
CURRENT  VALUE  IS  DSK 
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UNAME 

THE  USERNAME  USED  FOR  FILING 
CURRENT  VALUE  IS  usr 


INCHAR 

THE  FIRST  LETTER  OF  THE  NAMES  OF  EXPRESSIONS  TYPED  BY  THE  USER 
CURRENT  VALUE  IS  C 

OUTCHAR 

THE  FIRST  LETTER  OF  THE  NAMES  OF  THE  VALUES  OF  OUTPUT 
EXPRESSIONS 
CURRENT  VALUE  IS  D 


LINECHAR  THE  FIRST  LETTER  OF 
INTERMEDIATE  DISPLAY  EXPRESSIONS 
CURRENT  VALUE  IS  E 


THE  NAMES  OF  THE  VALUES  OF 


TIME 

IF  TRUE  CAUSES  THE  TIME  REQUIRED  TO  EVALUATE  EACH 
(EXCLUDING  DISPLAY  TIME)  TO  BE  PRINTED 
CURRENT  VALUE  IS  FALSE 


INPUT  COMMAND 


RSET 

IF  TRUE  INTRODUCES  A  SPECIAL  DEBUGGING  MODE 
CURRENT  VALUE  IS  FALSE 

NOUUO 

IF  TRUE  INHIBITS  MODIFICATION  OF  CALL  INSTRUCTIONS,  A  DEBUGGING 
Al  D 

CURRENT  VALUE  IS  FALSE 


♦•♦SIMPLIFY*** 


SIMP 

IF  TRUE  CAUSES  AUTOMATIC  SIMPLIFICATION  OF  EVALUATED  EXPRESSIONS 
CURRENT  VALUE  IS  TRUE 


***DI SPLAY*** 


NOSTAR 

IF  TRUE  CAUSES  MULTIPLICATION  TO  BE  DISPLAYED  AS  A  SPACE 
CURRENT  VALUE  IS  TRUE 

DERIVATI VEABREV 

IF  TRUE  CAUSES  DERIVATIVES  TO  BE  DISPLAYED  AS  SUBSCRIPTS 
CURRENT  VALUE  IS  FALSE 

LINEL 

THE  LINELENGTH  USED  FOR  OUTPUT  AND  DISPLAY 
CURRENT  VALUE  IS  68 
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SCOPEHE I GHT 

THE  NUMBER  OF  LINES  USED  FOR  PLOTTING 
CURRENT  VALUE  IS  25 

SQRTFLAG 

IF  TRUE/  DISPLAYS  SQRT  AS  SQRT.  IF  FALSE/  DISPLAYS  SQRT  AS 
EXPONENT  1/2. 

CURRENT  VALUE  IS  TRUE 

EXPTD  I  SPFLAG 

IF  TRUE/  DISPLAYS  EXPRESSIONS  WITH  NEG.  EXPONENTS  USING 
QUOTIENTS. 

CURRENT  VALUE  IS  TRUE 


***RATI ONAL*** 


FULLFLAG 

IF  TRUE  CAUSES  RATS  IMP  TO  MULTIPLY  THROUGH  AND  REDUCE  TO  LOWEST 
TERMS  FORMS  LIKE  (AfB)tC 
CURRENT  VALUE  IS  FALSE 

NORf’EAT 

IF  TRUE  NO  GCDS  ARE  PERFORMED  WHEN  RE-RAT  I ONALLY  REPRESENTING  AN 
EXPRESSION 

CURRENT  VALUE  IS  TRUE 
INVERTFLAG 

IF  TRUE  CAUSES  RATSIMP  TO  REPRESENT  A|(-B)  AS  (A|B)|(-1)  THEREBY 
FACILITATING  SUBSTITUTIONS 
CURRENT  VALUE  IS  FALSE 

FACTORFLAG 

IF  TRUE  CAUSES  INTEGERS  TO  BE  FACTORED  BY  FACTOR  COMMAND 
CURRENT  VALUE  IS  TRUE 

RADSUBST 

IF  TRUE  ALLOWS  RADCAN  TO  BE  CALLED  BY  RATSUBST 
CURRENT  VALUE  IS  FALSE 

MODULUS 

IF  MODULUS  IS  A  POSITIVE  INTEGER  P  ALL  ARITHMETIC  IN  THE 
RmHONAL  FUNCTION  SYSTEM  WILL  BE  DONE  MOD  P 
CURRENT  VALUE  IS  FALSE 

BERLEFACT 

IF  TRUE  THE  BERLEKAMP  FACTORING  ALGORITHM  WILL  BE  USED  OTHERWISE 
THE  KRONECKER  ALGORITHM 
CURRENT  VALUE  IS  TRUE 
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GCDSWITCH 

IF  TRUE  THE  MODULAR  GCD  ALGORITHM 
REDUCED  PRS 

CURRENT  VALUE  IS  FALSE 


IS  USED  OTHERWISE  THE  COLLINS 


GCDOFF 

IF  TRUE  ALL  GCDS  ARE  1 
CURRENT  VALUE  IS  FALSE 


RATEPSI LON 

TOARAmNAfCSBERSE,NRRA?  C0NVERTING  PL0ATING  »INT  AMBERS 
CURRENT  VALUE  IS  1.0E-9 

♦♦♦SOLVE*** 


SOLVEFACTORS 

CURREN?  mu!  rsR,TRUE0  FACT°R  G'VEN  EXPRESS'°NRUE 

SOLVERADCAN 

CURREnI  VALUE  fsMFii.SEES  G'VEN  'XPRESS,0N  Wl™  "ADCAN 
SOLVEHEURS 

IF  TRUE  SOLVE  TRIES  VARIOUS  HEURISTICS 
CURRENT  VALUE  IS  FALSE 
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Rational  Function  Commands 


RATVARSCjtaxl,  •  •  •/^.sLLQ)  provides  a  method  for  specifying  the 

°^fring  °f  varJab1es  ,n  CRE  form.  The  most  main  variable 
will  be  yarn/  the  least  ("most  constant")  will  be  varl. 


RAT(A,yanl, . .  ./yam)  converts  the  expression  a  to  CRE  form.  The 
optiona1  Yad  serve  as  an  ordering  (as  in  RATVARS)  but  only 
within  the  scope  of  the  single  RAT  command. 


RATD I SREP(a)  converts  a  CRE  a  to  a  normal  prefix  expression. 

RATSJ^rfu^ . pin)/  FULLRATSiMP(*/*ALl . atm),  and 

are  slmPllf>ers.  Currently,  FULLRATS IMP  and 
RATSIMP  are  identical . 


FACTOR(A)  factors  a  polynomial  or  rational  function  a  (numerator 
and  denominator). 


SQFR(a)  computes  a  square-free  factorization  of  the  expression  a. 
A  number  of  special  checks  for  content  with  respect  to 
severa1  main  variables  occasionally  factors  polynomials  even 
though  the  factors  occur  singly. 

PARTFRAC(A/yfl£)  expands  a  rational  function  a  in  partial 
fractions  with  main  variable  va r . 

RATCOEF(£A£/A)  picks  out  the  coefficient  of  a  (which  may  be  a 
power,  product,  sum,  quotient,  etc.)  in  exo. 

RATSUBST(^,b,c)  substitutes  a  for  j*  in  £.  £  may  be  a  sum, 
product,  power,  etc. 


gCD(A/J£)  computes  the  greatest  common  divisor  of  a  and  a« 

D I VI DE(A/^,yLfl£)  computes  the  quotient  and  remainder  of  a  divided 
by  j y,  as  rational  functions  in  a  main  polynomial  variable, 
var. 


RESU LTANT(a, A/iiflUC)  computes  the  resultant  of  the  two  polynomials 
A  and  a,  and  eliminates  the  variable  var . 

MOD(a)  converts  the  polynomial  a  to  a  modular  representation  (mod 
MODULUS ).  X  must  be  in  only  one  variable. 

GFACTOR(a)  factors  the  polynomial  a  over  the  Gaussian  integers 
(i.  e.  with  SQRT(-l)  *  %\  adjoined) 
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The  Matching  Subsystem 

DECLARED var, ored)  declares  var  to  match  only  expressions 

satisfying  the  predicate  pred,  when  var  is  used  in  a  pattern. 

DEFMATCH( name. exp, varl . vark)  defines  a  pattern  matching 

program  wi th  name  name. 

DEFRULECname.exp, reel )  defines  a  transformation  rule  with  name 
name  which  matches  the  pattern  exp  and  transforms  it  to  the 
replacement  repl . 

APPLYlCexp. rl . rk)  (and  similarly  for  APPLY2)  applies  the 

rules  iJL,  te.,jik  to  the  expression  exp,  and  returns  the 
transformed  expression.  The  difference  between  APPLY1  and 
APPLY2  is  in  the  sequencing  through  the  expression  and  rules. 

TELLSiMP  (oat. repl )  (and  similarly  for  TELLS IMPAFTER)  changes  the 
simplifier,  so  that  in  all  subsequently  simplified 
expressions,  an  occurrence  of  the  pattern  pat  will  be 
replaced  by  the  expression  repl . 

Several  additional  predicates  and  testing  programs  are 
provided  for  use  in  constructing  patterns  and  their  predicates. 
SIGNUM(a)  returns  -1,0,  or  +1,  depending  on  whether  the  sign  of  a 
is  negative,  zero,  or  positive,  if  a  is  a  number,  this  question 
is  simple.  i f  a  i s  not  a  number,  its  slgnum  is  computed  from  the 
coefficient  of  the  leading  term  in  a  rationally  simplified 
expression  equivalent  to  a.  FREEOF(Az^)  returns  TRUE  if  ^  does 
not  depend  explicitly  on  a.  This  is  accomplished  by  searching 
through  y  for  an  occurrence  of  A/  and  assumes  that  a  is  not,  for 
example,  used  as  a  dummy  variable  of  Integration.  INT(a)  returns 
TRUE  if  a  is  an  integer.  CONSTANT(a)  returns  TRUE  if  a  Is  a 
constant.  REALNUM(a)  returns  TRUE  if  a  is  a  floating  point 
number.  RATNUM(a)  returns  TRUE  if  A  is  a  rational  number  or  an 
integer.  NUMBER(a)  returns  TRUE  if  a  Is  an  integer  or  a  floating 
point  ( real )  number. 


The  Matrix  Subsystem 

The  matrix  subsystem  currently  is  not  completely  integrated 
into  MACSYMA,  since  matrix  data  types  are  not  automatically 
handled  by  the  simplifier.  Furthermore,  the  ARRAY  facility, 
which  allows  more  general  data  types  (e.g.  hash-coded  indices 
need  not  be  integers),  is  a  separate  facility.  This  will,  we 
hope,  be  remedied  shortly. 

For  the  moment,  however,  the  following  commands  provide  a 
fairly  thorough  set  of  primitive  operations  on  matrices. 
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MATRIX  (rflwl, . .  .,££iu{n)  defines  a  rectangular  matrix  with  the 
Indicated  rows.  Each  row  has  the  form  of  a  list  of 
expressions/  e.g.  [a#  x**2#  y,  0]  Is  a  list  of  4  elements. 

ENTERMATR I X H )  allows  one  to  enter  a  matrix  element  by  element 
with  the  computer  asking  for  values  for  each  of  the  m  by  a 
entries. 

EMATRIXCiB/ji/A/i/l)  returns  an  m  by  n  matrix  with  all  entries  zero 
except  for  the  (1/1)  entry#  which  Is 

D I AGMATR I  X(ji/2i)  returns  a  diagonal  matrix  of  size  n  by  n  with  the 
diagonal  elements  all  j&.  An  Identity  matrix  Is  created  by 
D I  AGMATRI  X(ji/ 1 ) ,  or  one  may  use  the  next  command. 

IDENT(a)  produces  an  n  by  n  Identity  matrix. 

SETELMX(ji,l,l,fl])  creates  a  new  matrix  which  Is  Identical  to  the 
matrix  m  except  that  Its  (1/1)  element  Is  £. 

R0WX(m,D  creates  a  new  matrix  which  Is  the  1th  row  of  the  matrix 

m. 

C0LX(ni/l)  creates  a  new  matrix  which  Is  the  1th  column  of  the 
matrix  m. 

TIMEX(ml/ .  ../Ml)  multiplies  two  or  more  matrices  (or  scalars  and 
matr I ces) . 

ADDX(ml/.../mQ)  adds  two  or  more  matrices. 

D I FFERENCEX(ml/ml)  computes  ml  “  m2 . 

P0WERX(m/l)  computes  the  1th  power  of  the  matrix  m 

INVERX(m)  Inverts  the  matrix  m. 

TRANSX(jd)  produces  the  transpose  of  m. 

ECHELON(iq)  produces  the  echelon  form  cf  m* 

Ml  N0RX(ni/l/l)  computes  the  1/1  minor  of  the  matrix  m 

DETERM  I  NANT(ni)  computes  the  determinant  of  m. 

CHARPOLX(m/yar)  computes  the  characteristic  polynomial  for  m* 

That  ls#  DETERM  I  NANT(D  I  F  FERENC  EX  (m.  D  I  AGMATR  IX(var.«;lzp  of 
m) ) . 
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SUBMATRI  X(iq1/ .  . . /ihq/  nl,  ...,jm)  creates  a  new  matrix  composed 
of  the  matrix  M  with  its  ni_L  rows  deleted/  and  its  ill  columns 
deleted. 


The  Power  Series  Subsystem 

The  power  series  subsystem  is  divided  into  two  parts. 
The  first  handles  truncated  power  series/  and  the  second 
manipulates  summations  in  their  general  form.  The  code  is 
still  being  changed  to  accomodate  different  needs  as  they 
appear  in  applications/  so  that  the  following  description  is 
liable  to  need  revision.  This  subsystem  is  not/  at  the 
moment/  particularly  well  Integrated  into  the  MACSYMA  system, 
in  the  sense  that  there  are  no  calls  from  the  simplifier  to 
these  programs. 


TAYLOR(£Afi/Yax./fil/ J22&)  expands  the  expression  exp  in  a  truncated 
Taylor  series  in  the  variable  var  around  the  point  ££.  The 
terms  through  (*ar.-.£i)**eflM  are  generated. 

PS(£Z£/Yai:/£l/£fttt)  resembles  TAYLOR,  except  that  the  internal 

form  of  the  expression  is  a  special  form  especiaHy  suitable 
for  manipulation  as  a  truncated  expression.  Such  expressions 
can  be  manipulated  with  the  programs  PSPLUS,  PSMiNUS, 

PSTIMES,  PSEXPT,  and  PSDERIV  (for  adding,  negating,  multi¬ 
plying,  raising  to  a  power,  and  differentiating, 
respectively). 

POWERS ER I  ES attempts  to  generate  the  general  form  of 
the  power  series  expansion  for  exo  in  the  variable  yar  about 
the  point  J2±  (which  may  be  INF,  for  infinity). 

A  large  table  of  general  expansions  is  now  on  a  MACSYM  disk 
file,  including  hyberbolic,  hypergeometric,  and  various  other 
special  functions. 


Miscellaneous  Utility  Commands 

This  section  includes  a  few  miscellaneous  controls  that  the 
user  has  over  the  MACSYMA  system.  Some  of  these  are  not  commands 
in  the  strict  sense,  since  they  do  not  require  a  to  take 
effect . 


Control  -B  (written  £)  "seizes"  the  lineprinter,  if  it  is 
available,  and  outputs  future  lines  (in  tandem  with  the  console) 
on  the.  1  inepr  inter .  t  releases  the  lineprinter.  1?  prints  on  the 
next  line  the  contents  of  the  input  buffer.  This  is  useful  when 
several  characters  have  been  deleted,  to  clarify  current  input 
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line.  L/  on  Cathode  Ray  Tube  consoles/  clears  the  screen  and 
resumes  printing  on  the  first  line  (as  in  “K). 

MACSYMA  is  loaded  Into  a  LISP  system.  In  the  course  of 
running  programs/  it  is  occass i ona 1 1 y  useful  to  call  some  LISP 
program  directly.  In  order  to  read  and  evaluate  one  LISP  s- 
expresslon/  one  has  the  command  EVALO.  One  may  "quit"  out  of  a 
loop  and  return  to  the  supervisor  by  typing  K.  II  has  the 
effect  of  "breaking"  at  a  point  in  execution/  and  allowing  a  user 
to  examine  the  depth  of  program  nesting/  the  values  of  Internal 
variables/  etc.  $P  <space>  resumes  execution. 

2  causes  an  exit  from  MACSYMA  to  LISP.  The  LISP  expression 
(CONTINUE)  returns  to  the  MACSYMA  supervisor. 

When  within  MACSYMA,  it  is  sometimes  handy  to  read  in  files 
of  LISP  programs.  (Note  that  BATCH  handles  only  MACSYMA  Input, 
not  LISP  files)  The  following  command  help  one  to  do  this. 


LOADFI LE(fnl,fn2. device. username)  loads  a  file  as  described  by 
its  arguments. 

The  next  few  commands  help  file  away  MACSYMA  results. 

WRITEFI LE( dev  ice. username)  opens  a  file  for  writing. 

CLOSEF ILE(fnl.fn2)  closes  the  file. 

PLAUBACK()  "plays  back"  all  the  input  and  output  lir^s  since 

(Cl).  PLAYBACK(n)  plays  back  the  last  n  expressions  (Cl,  Di, 
and  Ei  count  as  1  each).  Ordinarily  this  would  be  done 
between  a  WRITEFI LE  and  CLOSEFILE  to  store  a  neat  set  of 
commands  and  responses  or  to  refresh  one's  memory  as  to  what 
he  has  already  done  (especially  at  a  CRT  console). 


Comments  are  welcome,  and  should  be  directed  to  Richard  Fateman. 
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Appendix  ii 

The  Polynomial  and  Rational  Function  Package  in  MACSYMA 

ii.l.  introduction 

This  appendix  describes  a  series  of  LISP  routines  for  ma¬ 
nipulating  sparse  polynomials  and  rational  functions  In  several 
variables.  This  description  Is  Intended  as  a  detailed  glide  to 
the  Implementation  and  assumes  some  familiarity  with  LISP  and  the 
concept  of  rational  expressions.  A  user-or ? anted  view  of  these 
facilities  may  be  found  in  the  previous  appendix. 

The  polynomial  programs  were  originally  written  by  W.  A. 
Martin.  They  were  debugged,  and  In  some  cases,  rewritten,  by  R. 
Fateman;  the  rational  function  and  conversion  programs,  the 
modular  greatest-common-divisor  (gcd)  algorithm  (4),  and  the 
lower  level  modular  arithmetic  routines  were  programmed  by  R. 
Fateman.  The  Berlekamp  factoring  algorithm  (2)  was  Implemented 
by  L.  Rothschild. 

Because  these  routines  are  written  entirely  In  LISP 
they  can  be  exported  to  other  LISP  systems  very  simply.  If  they 
are  to  Interface  with  another  LISP-based  algebraic  manipulation 
system,  the  only  programs  that  need  to  be  altered  are  those  which 
convert  to  and  from  rational  expression  form.  The  programs  are 
written  so  as  to  allow  complete  symbolic  algorithms  to  be 
composed  entirely  within  the  rational  function  package. 

It  Is  well  known  that  any  rational  function  can  be  written 
as  the  ratio  of  two  polynomials  with  integer  coefficients  and 
with  no  common  polynomial  divisors.  With  the  added  provisions 
that  the  denominator  be  ,">si t ive  and  that  the  polynomials  be  rep¬ 
resented  In  a  canon  I ca ’  f'vm  (such  as  recursive  In  the  variables 
In  some  fixed  order),  wa  can  produce  a  canonical  form  for  any 
rational  expression.  MACSYMA  has  a  special  Internal  repre¬ 
sentation  for  canonical  rational  expressions  (CREs)  which  has 
many  useful  properties,  the  most  important  of  which  is  that  this 
representation  will  map  any  set  of  functionally  equivalent 
rational  expressions  Into  a  unique  (canonical)  representation. 

11.2.  Representation  of  Canonical  Rational  Expressions 

A  rational  function  in  canonical  rational  expression  (CRE) 
form  is  represented  at  the  top  ("MEVAL")  level  of  MACSYMA  by  the 
form 

( (MRAT  SIMP  varllst  assoclist)  polyforml  .  poiyform2). 
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Each  polyform  is  a  list  of  the  form  (malnvarlable,  highest- 
exponent,  coefficient,  next-highest  exponent,  coefficient,  ...). 
Pol yforml  is  the  numerator,  polyform2  Is  the  denominator. 

When  the  coefficient  of  a  term  Is  zero,  the  exponent-coef¬ 
ficient  pair  is  omitted.  This  makes  the  representation  attrac¬ 
tive  for  sparse  polynomials.  The  coefficients  may  themselves  be 
polyforms  In  variables  with  a  lower  order,  or  may  be  numbers. 

This  recursive  property  makes  this  representation  suitable  for 
polynomials  In  any  number  of  variables,  and  makes  programming 
particularly  simple  in  LISP.  The  leading  coefficient  in 
polyform2  is  positive,  and  the  greatest  common  divisor  of  poly- 
forml  and  polyform2  Is  1.  The  ordering  of  variables  on  VARLIST 
determines  which  is  the  main  variable,  and  which  (recursively) 
are  main  variables  of  the  coefficients  of  the  main  variable. 

By  altering  the  definitions  of  CPLUS,  CM1NUS,  CTIMES, 

CEXPT,  CQUOTi ENT,  CD  I FFERENCE,  CFACTOR,  CDERIVAT1VE,  CGCD,  and 
PCOEFP,  the  non-polyform  coefficient  arithmetic  can  be 
re Impl emented  with  a  number  of  different  domains.  Currently,  the 
coefficient  arithmetic  Is  the  domain  of  arbitrary  precision 
integers. 

If  the  global  variable  MODULUS  is  set  to  a  positive  prime 
number  R,  rather  than  Its  default  value  of  NIL,  all  coefficient 
arithmetic  is  performed  modulo  R.  in  such  a  case,  the  repre¬ 
sentatives  of  the  field  have  values  between  -R/2  and  R/2. 

Alternate  coefficient  routines  have  been  implemented  by  R. 
Zippel  which  (along  with  minor  changes  in  the  rest  of  the 
programs)  allow  the  coefficients  to  be  rational  numbers,  or 
rational  functions.  Additionally,  he  has  written  programs  which 
automatically  truncate  their  results. 

A  set  of  special  coefficient  routines  which  have  "counters" 
in  them  is  also  available.  This  Is  convenient  for  examining  the 
number  of  coefficient  operations  required  for  execution  of  a 
program. 

Among  these  coefficient  routines,  the  only  one  whose 
purpose  Is  not  obvious  is  PCOEFP.  PCOEFP  is  a  predicate  which 
returns  T  when  applied  to  a  member  of  the  coefficient  domain 
(presently,  LISP  Integers),  NIL  for  a  member  of  the  polynomial 
domain  (i.e.  polyforms). 

Zero  could  consistently  be  treated  as  an  empty  polynomial 
or  as  a  zero  coefficient.  For  our  purposes  it  Is  most  convenient 
to  express  zero  by  0. 
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Returning  to  the  generalized  form  for  a  CRE,  we  see  that  it 
is  to  our  advantage  to  have  a  rapid  method  for  testing  whether 
"x"  is  more  of  a  main-variable  than  "y"  (etc.).  In  order  to 
compare  two  variables  on  the  VARLIST  quickly,  each  variable  is 
associated  with  a  generated  symbol  (they  look  like  G001,  G002, 
...)  and  the  generated  symbols  are  ordered  by  their  values. 
POINTERGP  is  used  as  an  order-testing  predicate.  The  generated 
symbols  are  used  in  the  body  of  the  polyforms,  and  the  varllst 
and  assoclist  are  used  only  In  conversion  to  and  from  CRE  form. 
Examples 

Assume  the  POINTERGP  ordering  of  3  generated  symbols  G001, 
G002,  and  G003  is  In  that  order. 

3/4  »  ( (MRAT  SIMP  NIL  NIL)  3  .  4) 

X/3  -  ((MRAT  SIMP  (X)  (GOOD)  (G001  1  1)  .  3) 

3/X  -  ((MRAT  SIMP  (X)  (GOOD)  3  G001  1  1) 

(X2+XY)/Z  -  ((MRAT  SIMP  (Z  Y  X)(G003  G002  GOOD) 

(G001  211  (G002  1  1)) 

G003  1  1) 


Polynomials  are  written  with  polyform2  ■  1 
3  -  ((MRAT  SIMP  NIL  NIL)  (3  .  D) 

7X  -  ((MRAT  SIMP  (X)(G001))  ((G001  1  7)  .  D) 

0  -  ((MRAT  SIMP  NIL  NIL)  (0  .  D) 


Note  that  both  the  assoclist  and  varllst  may  contain  var¬ 
iables  not  actually  in  the  expression.  Thus 

3  -  ((MRAT  SIMP  (X  Y  ZMG003  G002  GOOD  (3  .  1)) 

is  valid. 


11.3.  Polynomial  Functions 

In  this  section,  X  and  Y  are  expressions  with  the  same 
ordering  of  variables. 

PCOEFP(X)  returns  T  if  X  is  a  member  of  the  coefficient  domain, 
i .e.  a  number  or  NIL. 
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PPLUS(X,Y)  adds  fcwo  polyforms  to  yield  a  polyform. 

PTIMES(X,Y)  multiples  two  polyforms  to  yield  a  polyform. 

PQUOTI ENT(X,Y)  divides  X  by  Y  to  yield  a  polyform;  signals  an 
ERROR  if  the  remainder  is  not  zero. 

PDIVIDE(X,Y)  yields  a  list  of  two  RATforms:  the  quotient  and  the 
remainder  with  respect  to  the  main  variable  of  X,  of  X 
divided  by  Y. 

PDI FFERENCE(X,Y)  yields  the  value  of  X-Y. 

PDER I  VAT  I VE(X, VAR)  yields  the  polyform  equal  to  the  formal  deriv¬ 
ative  of  X  with  respect  to  the  variable  VAR,  which  need  not 
be  the  main  variable  of  X. 

PEXPT(X,N)  raises  the  polyform  X  to  the  power  N,  which  must  be  a 
non-negative  LISP  integer.  It  uses  a  modified  multinomial 
expansion  technique  which  is  generally  superior  to  other 
methods  (13). 

OLDGCD(X,Y)  yields  the  polynomial  greatest  common  divisor  of  X 
and  Y.  Collins'  reduced  PRS  algorithm,  described  in  (27),  p. 
372  is  used. 

OLDCONTENT(X)  yields  a  list  of  the  (positive)  content  of  X  and 
the  primitive  part  of  X.  X  is  considered  to  be  a  polynomial 
in  one  variable  with  coefficients  that  are  polynomials  in  the 
other  variables.  Thus  the  content  of  xyz+xy  with  x  the  main 
variable  is  gcd(yz,y)  or  y,  while  the  primitive  part  is  xz+x. 
This  definition  is  used  for  the  Collins  algorithm. 

PGCD(X,Y)  yields  the  polynomial  greatest  common  divisor  of  X  and 
Y.  The  modular  algorithm  described  by  Brown  in  (4)  is  used. 

It  calls  PGCDM,  PGCDP,  and  PGCOU  corresponding  to  algorithms 
M,  P,  and  U  in  (4). 

PCONTENT(X)  yields  a  list  of  the  (positive)  content  of  X  and  the 
primitive  part  of  X.  X  is  considered  to  be  a  polynomial  in 
many  variables  with  integer  coefficients.  The  content  is 
always  an  integer.  Thus  the  content  of  xyz+xy  is  1,  with 
primitive  part  xyz+xy.  This  definition  is  used  for  the 
modular  GCD  algorithm  (PGCO). 

PM0DC0NTENT( X)  is  a  peculiar  type  of  content  calculation  required 
by  the  multvariate  modular  gcd.  X  is  considered  to  be  a 
polynomial  whose  coefficients  are  polynomials  with  modular 
coefficients  in  one  variable  (the  main  variable  in  X).  Thus 
the  content  of  xyz+xy  for  main  variable  x,  is  x,  with  primi- 


180 


tive  part  yz+y. 

PMINUS(X)  yields  -X/  0  if  X  is  0. 

PMiNUSP(X)  yields  T  if  the  leading  coefficient  of  X  Is  negative, 
otherwise/  NIL. 

P I NTERPOLATE ( L/ VAR)  finds  an  interpolation  polynomial  given  a  set 
of  points.  L  is  a  list  of  n  values  (integers  or  poly- 
nomisls)  of  a  polynomial  P  to  be  found  by  Interpolation  at 
the  points  0/  1/  ..,/  n.  VAR  is  the  main  variables  of  P. 
PINTERPOLATE  returns  P  only  If  p  has  Integer  coefficients/ 
and  signals  an  ERROR  otherwise.  PINTERPOLATE  is  used  by 

DC  A  TTOH  " 


PCSUBST(X/ VAL/VAR)  substitutes  the  number  VAL  for  the  variable 
VAR  in  the  polyform  X. 

PFACTOR(X)  returns  a  list  of  items  consisting  of  positive  (except 
possibly  -1)  primitive/  irreducible  (over  integers)  factors 
of  X,  followed  in  each  case  by  the  degree  (multiplicity)  of 
that  factor.  Berlekamp's  algorithm  (2)  is  used.  PFACTORwill 
not  factor  with  respect  to  variables  whose  generated  symbol 
is  on  the  list  $D0NTF ACTOR.  If  SFACTORFLAG  Is  set  to  NIL/ 
the  integer  part  will  not  be  fa'.tored.  For  example,  with 
$FACTORFLAG  set  to  T: 

PFACTOR  ((X  4  2))  *  (2  1  (X  1  1)  4) 

PFACTOR  (0)  «  (0  1) 

PFACTOR  (1)  -  (1  1) 

PFACTOR  (-3)  -  (-1  1  3  1) 

PFACTOR  (-1)  b  (-1  1) 

PFACTOR  ((X  4  (Y  4  G))>  ■*  (3  1  2  1  (X  1  1)  4  (Y  1  1)  4). 

PSQFR(X)  is  the  same  as  PFACTOR  except  the  polynomials  are  not 
necessarily  irreducible/  just  squarefree  (22/  p.  381). 

PMOD(X)  returns  the  polynomial  X  with  its  coefficients  reduced 
mod  MODULUS.  If  MODULUS  is  NIL/  X  is  returned  unchanged. 

Ii.4.  Rational  Functions 

Hare  X  and  Y  denote  ratforms.  A  ratform  is  a  dotted  pair 
of  polyforms/  that  is,  the  CDR  of  an  MEVAL-level  CRE.  All 
results  are  in  lowest  terms.  The  denominator  is  always  positive. 
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Division  by  zero  will  cause  a  LISP  ERROR. 


RATPLUS(X,Y)  performs  addition:  X  ♦  Y. 

RATTIMES( X, Y,SW)  performs  multiplication:  X*Y.  If  SW  is  NIL,  it 
will  be  assumed  that  if  X*a/b  and  Y»c/d,  that  the  greatest 
common  divisor  of  a*c  and  b*d  is  1.  In  situations  where  an 
expression  is  repeatedly  converted  to  ORE  form,  a  consi¬ 
derable  amount  of  time  can  be  saved  by  not  repeating  the 
g.c.d.  calculations. 

RATQUOT I  ENT ( X, Y)  performs  division:  X/Y. 

RATI NVERT(X)  Inverts  X:  1/X. 

RATMINUS(X)  performs  negation:  -X. 

RATD I F ( X, Y )  performs  subtraction:  X-Y. 

RATEXPT( X, N)  raises  X  to  the  Nth  power,  where  N  is  a  (possibly 
negative)  LISP  integer. 

RATREDUCE( P,Q)  takes  two  POLY forms  P  and  Q  and  rtduces  them  to 
lowest  terms,  RATREDUCE  Is  used  when  needed  by  the  other 
rational  functions,  except  as  noted  in  RATTIMES.  RATREDUCE 
returns  P/Q  as  a  ratform. 

RATFACT(X)  factors  the  numerator  and  denominator  of  the  ratform 
X,  and  returns  a  list  similar  to  that  returned  by  PFACTOR, 
except  that  the  factors  of  the  denominator  will  have  negative 
multiplicities. 

RATABS(X)  returns  the  absolute  value  of  the  ratform  X. 

RATDERI V( X, VAR)  returns  the  formal  derivative  of  the  ratform  X 
with  respect  to  VAR. 

RATGCM( X, Y )  returns  the  greatest  common  multiple  of  X  and  Y.  If 
X  ■  a/b  and  Y  ■  c/d,  then  gcm(X,Y)  ■  gcd(a,c)  * 
gcd(b,d)/(b*d) . 


11.5.  Conversion  Functions 


$RAT(M)  uses  NEWVAR,  described  below,  to  put  non-rational 

stbepress Ions  of  M  on  a  variable-list  (VARLIST),  and  then 
calls  RATREP(M, VARLIST)  as  described  below.  Floating  point 
numbers  are  converted  to  rational  numbers  (within  a  relative 
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error  of  $RATEPSILON,  set  by  the  user). 


RATREP(M, VARLIST)  given  a  non-CRE  MACSYMA  expression  M  creates 
the  appropriate  CRE  form.  Including  the  (MRAT  SIMP  varilst 
assoclist)  prefix.  All  the  variables  and  non-ratlonal  ex¬ 
pressions  on  VARLiST  will  be  associated  with  generated 

25b2lJi.Ji,Ste*  °H  assoc,!lst-  An  the  non-ratlonal  elements 
b  ,n  M  must  be  on  the  llst  VARLIST  given  to 
RATREP.  oee  the  discussion  of  NEWVAR  for  how  this  may  be 
done. 


$RATD I SRE PC X )  converts  the  CRE  X  into  a  standard  MACSYMA  prefix 
expression.  A  RATSiMP  flag  Is  put  on  each  of  the  operators. 
Extraneous  parts  of  the  expression  (such  as  multiplication  by 
one,  nested  MPLUSs  or  MTIMESs,  exponentiation  by  0  or  1)  are 
edited  out  automatically. 


Example: 


2X  +3XY+1  ■  ((MRAT  SIMP  (Z  Y  X)(G003 
(G001  220  (G002  1  (G003  130  1)))) 
to 


G002  GOOD  )  ( (G001  2  2 
.  D)  is  SRATDISREP'd 


0 


( (MPLUS  RATSIMP) ( (MTIMES  RATSiMP)  2  ( (MEXPT  RATSIMP)  X  2)) 
( (MTIM  RATSIMP)  3  Z  Y)  1)  * 


NEWVAR(X)  examines  an  expression  M  (a  MACSYMA  prefix  form  or  a 
CRE,  or  a  sum,  product,  difference,  or  quotient  of  the  two), 
and  constructs  a  list  L  of  non-ratlonal  components  and 
variables  which  are  not  already  on  the  global  VARLIST.  That 
part  of  L  collected  from  CREs  Is  placed  on  the  VARLIST  In  as 
nearly  the  same  order  as  in  the  CREs  as  possible.  The  rest 

°Jnfi t5e  func5Ion  GREAT  (by  the  function  SORT) 

D  JN  f  t0  th*  of  VARLIST.  A  subsequent  call  to 

RATREP  with  arguments  M  and  the  global  VARLIST  will  then 
produce  the  CRE  corresponding  to  M. 

ORDERPOI NTER( L)  sets  up  the  correspondence  between  generated 
szt°  5  f Jd,tbe  expressions  on  the  list  L,  generating  new 
aymbols  if  L  Is  longer  than  any  previous  VARLIST.  These 
generated  symbols,  placed  on  the  CDR  of  the  value  of  the 

variable  GENVAR,  also  form  the  assoclist  for  a  newly  RATREP'd 
express  Ion . 


li.G.  Command- level  Programs 


SRATSIMP(X)  conver 
subexpress l ons 
the  effect  of 


ts  the  expression  X  and  all  its  non-ratlonal 
into  CRE  form,  and  then  back  again.  This  has 
rationally  simplifying  the  expression  X  and  al 


l 
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its  non-rational  expressions. 


$RESULTANT( X)  computes  the  resultant  of  the  polynomial  X,  using  a 
polynomial  remainder  sequence  technique. 

$GCD(X,Y)  computes  the  greatest  common  divisor  of  the  two 
pol ynomi al s  X  and  Y. 

$Di V i DE( X, Y, V)  calls  PDiVIDE  on  the  two  polynomials  X  and  Y, 
whose  main  variable  is  V. 


$FACTOR(X)  factors  the  numerator  and  denominator  of  the  rational 
function  X  over  the  integers. 

$GFACTOR( X )  factors  the  polynomial  X  over  the  Gaussian  integers 
(with  sqrt(-l)  ■  %\  adjoined). 


$M0D( X)  converts  the  single-variable  polynomial  X  to  modular 

representat ion,  accessing  the  value  of  MODULUS  set  globally. 

11.7.  Other  Notes 


Efficiency  can  be  considerably  improved  if  the  knowledge 
that  a  quotient  is  in  lowest  terms  can  be  preserved.  If  the 
global  flag  NOREPEAT  is  set  to  T,  then  any  MQUOTi ENT  or  MTIMES 
with  a  RATSIMP  flag  (placed  there  by  $RATD I SREP)  will  not  be 
RATREDUCE  d  as  it  is  being  converted  to  CRE  form  by  RATREP.  it 
may  occasionally  be  useful  to  set  NOREPEAT  to  NIL/  since  F.  «  (x- 
is  reduced,  but  if  z  *  x  ,  E  can  be  reduced  further  on 
a  second  pass  by  treating  it  as  (za -l)/(z  +  l) . 


Setting  $GCDSW|TCH  to  NIL  replaces  the  modular  GCD 
algorithm  with  the  reduced  PRS  algorithm.  Setting  SGCDOFF 

?ncnl>]rS^the  ccd  rout'nes  entirely  (all  geds  are  1).  Setting 
$BERLEFACT  to  NIL  replaces  the  Berlekamp  factoring  algorithm  with 
the  less  efficient  Kronecker  algorithm. 


Listings  of  these  functions  and  their  subroutines  are  are 
available  from  the  author. 
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